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0 Nucleic acid encoding the alpha or beta chains of Inhibin and method for synthealzing 
polypeptides using such nucleic acid. 



@ ONA encoding the prepro inhibm a and chains 
has t>een isolated. This ONA is ligated into expres- 
sion vectors and used to transform host calls for the 
preparation of inhibin or activin. Also provided are 
prohormone domains and other inhibtns a or 0 chain 
derivatives having therapeutic or diagnostic interest 
The compositions provided herein are useful m the 

^manipulation of fertility in antmais. 
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NUCLBC ACfD ENCODING THE a OR 0 CHAINS OF INHIBIN AND METHOD FOR SYNTHESIZING 
POLYPEPTIDES USING SUCH NUCUaC ACID 



BACKGROUND 

TViis Invention relates to methods for making 
proteins in recombinant ceil culture which contain 
the a or fi chains of inhibin. in particular, it relates 
to methods for obtaining and using DMA which 
encodes inhtbin, and for maidng inhibm variants 
that depart from the amino acid sequence of natu- 
ral animal or human inhibins and the naturally- 
occurring alleles thereof. 

Inhibin is a protein produced in the gonad 
which acts specificaJly at the prturtary level to in- 
hibit the secretion of follicle-stimulating hormone - 
(FSH). The existence of inhibin was first postulated 
by McCullagh in 1932 ("Sdence" 7g: 19-20). Such 
preferenttat regulation of the gonadotropin secretion 
has generated a great deal of interest and has 
prompted many laboratories in the past fifty years 
to attempt to Isolate and characterize this sut>- 
stance from extracts of testis, spermatozoa, nete 
testis fluid, seminal plasma and ovarian follicular 
fluid, using various bioassays- Although many re- 
ports have appeared in the literature claiming the 
purification of inhibin-Iike material with molecular 
weights ranging from 5,000 to 100,000 daitons, 
subsequent studies have shown that these sub- 
stances were not homogenous, did not have the 
high specific activity expected of true inhibin and/or 
failed to exhibit the motecutar characteristics of 
inhibin as described herein (de Jong, Inhibin-Factor 
Artifact, "Molecular & Cellular Endocrin." 13: 1-10 - 
(1979); Sheth et al^ 1984, •P^.S.*' 165(1) 11-15; 
Seidah ej al., 1984, •F.E.B.S." 175(21:349-355: UFja 
§X al^ March 1985. "F.E.B.S.- ^Sg{'i):^8^'^a4: U si 
al.. June 1985, "Proc NaL Acad. Sci. USA" 
8^:4041-4044: Seidah ai.» "F.E.B.S." 1fi7(1):98- 
102; and Beksac oj al., 1984. "Intern. J. Andrology* 
7:389-397). 

A polypeptide having inhibin activity was puri- 
fied from bovine or ovine follicular fluid (PCT 
86/00078, published January 3, 1986). This protein 
was reported to have a molecular weight of 
56,000±1.000 on SDS-PAGE and was dissociable 
into two subunits having apparent molecular 
weights of 44,000±3,000 and 14,000£2.Q00. Amino 
terminal sequences for each subunit were de- 
scribed. 

Two proteins botfi having a molecular weight of 
about 32,000 daitons and having inhibin activity 
have been successfully isolated from porcine fol- 
licular fluid. Purification of porcine inhibin to sub- 
stantial homogeneity, i.e., about 90% by vreight of 
total protein in the fraction, was achieved through a 



combination of protein separation procedures in- 
cluding heparin-Sepharose affinity chromatography, 
gel filtration and reverse-phase, high-pei^rmance 
Dquid chromatography (RP-HPLC). 
5 These proteins were isolated to substantial ho- 

mogeneity from material obtained from swine and 
are referred to as Protein A and Protein B. Each 
protein has a molecular weight of about 32,000 
daitons {32K) and is composed of two polypeptide 
TO chains having molecular weights of 18,000 and 
1 4,000 daitons, respacth^ly, the chains being 
linked together in the hormonally-active protein by 
disulfide bonding. The amino-terminaJ amino add 
residue sequence of the 18,000 daitons (18K) or 
75 alpha chain of both proteins was determined to be 
Ser-Thr-AIa-Pro-Leu-Pro-Trp-Pro-Trp-Ser-Pro-Aia- 
Ala-Leu-Arg-Leu-Leu-Gln-Arg-Pro-Pro-Glu-Glu-Pro- 
Aia-VaL The amino-terminal amino acid residue 
sequence of the 14.000 dalton (14K) or beta chain 
20 of Protein A was determined to be Gly-Leu-Glu-X- 
Asp-GIy-Lys-Val-Asn-lle-X-X-Lys-Lys-Gln-Phe-Phe- 
Vai-Ser-Phe-Lys-Asp-lie-Gly-Trp-Asn-Asp-Tfp-lte- 
lle-AJa and of Protein B was determined to be Gly- 
Leu-Glu-X-Asp-Gly-Art-Thr-Asn-Leu-X-X-Arg-Gln- 
ss Gln-Phe-Phe-He-Asp-Phe-Arg-Lau. Proteins A and 
B have been completely characterized. Each 32K 
protein exhibits inhibin activity in that It specifically 
inhibits the basal secretion of FSH but does not 
inhibit secretion of luteinizing hormone (LH). The 
30 individual chains were not hormonally-active. 

After the filing of the parent application hereto, 
inhibin B-ch^n dimers were shown to exist in fol- 
licular fluid as naturally-occurring substances, 
termed activin, which are capable of stimulating 
35 FSH release by rat anterior pituitary cells (Vale ^ 
aJ., 1986, "Nature" 321:778-779 and Ung al, 
1986. "Nature" 321:779-782). 

The amino add sequence of the a and /3 
chains of inhibin from humans remained unknown 
40 until the invention heroin. The large quantities of 
human follicular fluid required to parallel the stud- 
ies conducted with animal Inhibins are not readily 
available, nor is there any assurance that human 
and animal inhibins would be suffidentiy similar 
45 that purification using a p>arallel procedure would 
t>e effective. Accordingly, methods are needed for 
determining the characteristics and amino acid se- 
quence for human inhibin. 

Also needed are economical methods for mak- 
50 ing the a and 0 chains of inhibin in large quantities, 
preferably entirely and completely free of proteins 
from the spedes homologous to the inhibin in 
question, which inhibin preferably also is t»ologi- 
cally active. 
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These and other objects will be apparent from 
consideratjon of the inver^tion as a whole. 

SUMMARY 

Nucleic acid now has been isolated and cloned 
in replicabte vectors which encodes the mature 
porcine and human a and fi chains of tnhibin and 
their precursor prepro arKi pro forms. Sequencing 
of inhibin-encoding cDNA has led to the identifica- 
tion of prodomain regions located N-terminal to the 
mature inhibin chains that represent coordinately 
expressed bloiogicaJIy active polypeptides. The 
prddomain regions or prodomain immunogens are 
useful in monitoring preproinhibin processing in 
transformant call culture or in experiments directed 
at modulating the clinical condition or reproductive 
physiology of animais. Thus a or ^ chain nucleic 
acid is used to prepare prodomain sequences from 
the precursor forms of the inhibin chains, to trans- 
form host calls for the recomtnnant expression of 
mature inhibin a and/or 0 chains, and in diagriostic 
assays, tn particular, regions from inhibins a and/or 
chains are expressed in recombinant ceil culture 
by a mett)od comprising ligattng the nucleic add 
encoding the region into a repficable vector under 
the control of a promoter, transforming a host cell 
with the vector, culturing the host cell and re- 
covering the prodomain. activin or inhibin from the 
cultured celt. Inhibin. activin and prodomairrs pro- 
duced by the method of this invention are entirety 
free of homologous source proteins and can be 
produced in biologically active form. 

The nucleic acids identified herein encode the 
a, and chains of porcine or human inhibin. 
Recombinant cells are transformed to express afix 
or afi9 inhibins. or to express 4-chain heterodimers 
or homodimers {which are collectively referred to in 
the literature as activin), ^-chain dimers as pro- 
ducts of recombinant calls expression are free of 
homologous proteins with which they ordinarily are 
associated in nature. 

tnhibin or activin and their nontoxic satts. com* 
bined with a pharmaceutically acceptable carrier to 
form a pharmaceutical composition, are adminis- 
tered to mammals, including humar^s, for control of 
fertility. Administration of inhitMn decreases fertifity 
in female mammals and decreases spermatogen- 
esis in male mammals, and administration of a 
sufficient amount induces infertiirty. tnhit^in is also 
useful in tests to diagnose inrfertility. Activin has 
been shown in the literature to be capable of stimu- 
lating F5H release from pituitary cells and accord- 
ingly is useful as a fertility inducing therapeutic. 



The method of this invention also facilitates the 
convenient preparation of inhibin. activin and pro- 
domain variams having primary amino acid se- 
qu ncas and^or glycosylation differing from the na- 
5 tive analogues, in particular fusions of im- 
munogenic peptides with inhibin, activin or pro- 
domain sequences. 

10 Brief Description of the Drawings 

Rg. 1A is a schematic representation of the 
porcine a-chain mRNA. Overlapping cOf^ clones 
used in the sequence determination are shown 

75 above the diagram of the mRNA structure. Black 
boxes on the 3' ends of X clones indicate that 
these clones were obtained by specific priming. 
Untranslated sequences are represented by a line, 
coding sequerKies are txsxed. The unfilled portion 

20 represents tfie coding region for tfie signal peptide 
and pro-sequences, and the cross-hatched areas 
indicates the 134 amino add o-chain. The scale is 
in nucleotides from the 5' end of the longest cDNA 
done. 

2ff Rg. 16 shows the nudeotide and predicted 

amino add sequence of the pordne cr-chain pre^- 
cursor. Nucleotides are numbered at the left and 
amino adds are numt)ered throughouL The amino 
add sequence underiined was used to design a 
30 long synthetic ONA probe. The 364 amino add 
precursor includes a hydrophobic signal sequence, 
a pro-region, and the mature n-chain (amino adds 
231-364). The proteolytic processing site Arg-Arg - 
(black ttar) immediately precedes the NH2-termh> 
35 nus of the mature alpha chain. Several other pu- 
tative dtt^sic processirvg sites presertt in the pro- 
region are indicated by open bars. The two poterv 
tiai N-iinked gfycosylatk^n sites are shown by the 
cross-hatched bars. The AATAAA box dose to the 
40 3* end of the mRNA is underiiried. 

Fig.2A is a schematic representation of the 
porcine 0AsrKl 0b suburtlt mRNAs with coding se- 
quences tx>xed. The 0a and ^esubunits (dashed) 
are encoded towards the 3* end of the codir^ 
45 sequences. The 3' and 5* untranslated regions are 
shown as a lir^. The length of ttw 5' and 3' 
untranslated region of the 0B subunit mRNA is 
infen^ from the size of the mRNA (Fig. 3) and its 
obvious similarity to ttie mRNA. Tentative ro- 
50 gions of tt>e cONAs are shown as dashed in the 
diagram. The relative positions of the overlapping 
ofigo-dT p>rimed cONA dones and the randomly 
primed dones (XPIN^aSs, \P\N0o1^ and 
XPIN^b2s) are indicated. The scale is in 
55 nucleotides from the 5* end of the 4.5 kb mRNA. 

Fig. 2B is the nucleotide sequence and de- 
duced amino add sequence of the pordne inhit^n 
^-subunit precursors. The Resequence is aligned 
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with the sequence for maximum homology. The 
NHi-terminal of the ^-subunit precursors are in- 
dicated by bracket and arrows. Cysteine residues 
are shaded, possible processing sites are indicated 
by open bars, and a potentiai glycosylation site is 
shown by the cross-hatched box. A very GC-rich 
region present 3* to the termination codon intron 
sequences is underlined and overlinad in both se- 
quences. Amino acid sequences used to design 
oiigonucleottde probes are underlined, as is the 
AATAAA polyadenyfation signal. There was one 
nucleotide difference between \PIN-0a 8 and other 
dones covering this area. A G-to-A change causes 
a change of amino add 278 from a glycine to a 
serine. The proteolytic processing site Arg Arg Arg 
Arg Arg (black bar) immediately precedes the NH» 
terminus of the mature jSASubunit with the pros- 
equences kx:ated upstream. The amino adds for 
the 0A subunit only are numbered. 

Rg. 3 is a Northem blot analysis of porcine 
ovarian mRNA with a. 4a and subunit cDNA 
hybridization probes. Lanes a, b. c, d. and f are 
polyA* mRNA and e and g are total RNA, The 
position of the 2BS and 183 ritxisomaJ RNAs are 
shown. Lanes a, d, and e were hybridized with an 
cr-subunft cDNA probe; lanes d, e and g with a 4a 
subunit specific probe, and tane c with a ^asubunit 
spedfic probe. The a-subunit mRNA ts approxi- 
mately 1.5 kb, the 4* subunit mRNAs are approxi- 
mately 4.5 kb. The hybridizations shown in lanes a, 
b, and c were performed with probes of approxi- 
mately equal length and spedfic activity in order to 
judge relative mRNA levels. 

Rg. 4A is a comparison of the human 4-TGF 
amino add sequence and pordne inhibin fi^ and 
4b amino add sequences. The sequences were 
aligned around the cysteine residues, iderrt'cal resi- 
dues are boxed, while conservative changes are 
designated by an asterisk. 

Rg. 4B compares the <r-subunrt sequence 
with the 4A-inhibin sequence. 

Rg. 5 depicts the construction of a repre- 
sentative recombinant expression ptasmid for por- 
dne inhibin. 

Rg. 6 shows the nucleotide sequence and 
deduced amino add sequence of the human «- 
infiibin cDNA, The 335 amino add pro-or inhibin 
sequence is numbered from the hypothesized sig- 
nal deavage site. Sixteen amino acids of the signal 
sequence are numbered -1 through -16. Homology 
with the pordne sequence predicts a further 12 
amino add residues in the signal sequence. In this 
and other figures, putative dibasic processing sites 
are shown by the open bars, glycosylation sites 
indicated by cross-hatched bars, and amino termi- 
nal mature chain processing sites are depicted as 
black bars. The poly<A) additional signal sequence 
is underlined- Cysteine residues are shaded. 



Rg. 7 is a comparison of the human and 
porcine o-inhibin protein sequences. Spaces are 
introduced to maximize the homology; positions of 
non-identity are indicated by stars. Numbering is 
5 as for the porcine sequence, which is one amino 
add shorter than the human. 

Rg. 8 shows that the nucleotide and de- 
duced amino add sequence of the human 4a if>- 
hibin signal sequence (residues -28 through -1) is 
7a 28 amirw adds with the precursor t>eing 378 amino 
adds in length. The basic processing site is in- 
dicated by a black bar, and a potential glycosyla- 
tion site in the precursor is indicated by a cross- 
hatched bar above the sequence. Cysteine resi- 
75 dues are shaded. 

Rg. 9 illustrates the nucleotide and deduced 
amino add sequence of human 4b inhibin cONA. 
The sequence commences at a cysteine residue - 
(position 7). which lines up with the cysteine 
20 present at residue 7 in the 4a sequence (see Rg. 
8). The processing site for the mature 4b inhitsin is 
shown as a black bar and a potential glycosylation 
site as a cross-hatched bar. Cysteine residues are 
^ shaded. 

25 

DETAILED DESCRIPTION 

The polypeptides of this invention are the a 

30 and 4 chains of inhibin. as well as their muftimer 
forms (activtn and inhibin), their prepro forms and 
their prodomains. togetfier with glycosylation andAx- 
amino acid sequence variants of each chain or 
form thereof. Inhibin (including alleles) from human 

35 or animal sources inhibits the basal release of FSH 
but not of LH from anterior pituitary cells while 
activin does the opposite (hereinafter referred to as 
*hormonally active* activin or inhibin). 

Generally, amino add sequence variants wiD 

40 be substantially homologous with the relevant por- 
tion of the pordne or human a or 4 chain se- 
quences set forth in Rgs. IB. 2B. 6, 8 and 9. 
Substantially homologous means that greater than 
about 70% of the primary amino add sequence of 

45 the candidate polypeptide corresponds to the se- 
quence of the porcine or human chain when 
aligned in order to maximize the numtjer of amino 
add residue matches betwem the two proteins. 
Alignment to maximize matches of residue indudes 

50 shifting the amino and/or cartxaxyl terminus, if>- 
trodudng gaps as required and/or deleting residues 
present as inserts in the candidate. For example, 
see Rgs. 2B and 7 where the 4a and 4b sutnjnits 
or human and p)orcine o-tnhibin sequences are 

55 aligned for maximum homology. Typically, amino 
acid sequences variants will be greater than about 
90% homotogous with the corresponding riative 
sequences shown in Rgs. 1 B. 2B. 6. 8 and 9. 
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Variants that are not hormonally-acfve fall with- 
in the scope of this invention, and inciud© potypep- 
tides that may or may not be substantially homolo- 
gous with either a mature inhibin chain or pro- 
domain sequence, but which are 1 ) immunological- 
ly cross-reactive with antibodies raised against the 
native counterpart or 2) capable of competing with 
such native counterpart polypeptides for cell sur- 
face receptor bir>ding. Hormonally inactive variants 
are produced by the recombinant or organic syn- 
thetic preparation of fragments, in particular the 
isolated 0 chains of inhibin, or by introducing ami- 
no add sequence variations so that the molecules 
no longer demonstrate homional activity as defined 
above. 

Immunologicai or receptor cross-reactivity 
means that the candidate poiypeptide is capable of 
competitively inhibiting the binding of the 
homnonally-active analogue to polyclonal antisera 
raised against the honmonafly^-active analogue. 
Such antisera are prepared in conventional fashion 
by injecting goats or rabbits S.C. with the 
hormonalty-active analogue or derivative in com- 
plete Freunds adjuvant* foltowed by t)ooster in- 
traperitoneal or S.C. injections in incomplete 
Freunds. 

Variants that are not hormonally active but 
which are capable of cross-reacting with antisera to 
hormonaily-active inhibin. activin, or prodomains 
are useful (a) as reagents in diagnostic assays for 
the native analogues or their antibodies, <b} when 
insolubilized in accord with known methods, as an 
agent for purifying anti-native analogue antibodies 
from antisera. and (c) as an immunogen for raising 
antibodies to honmonally-active analogues. 

This invention includes the pro and/or prepro 
sequences of the inhibin a or ff chain precursors. 
or their immunologically or btologicaily active frag- 
ments, substantially free of the conresponding ma- 
ture inhitHn chains. These sequences for porcine 
and human inhibin are shown in Bgs. IB, 2B, 6, 8 
arid 9. The prepro sequence for the porcine o 
subunit precursor is ttte polypeptide comprised by 
residues 1 to about 230. while the >3A5ubunit pro 
sequence is comprised by residues 1 to about 308. 
These sequences shall be refenred to herein as 
encompassing prodomain sequences. 

The a and sutKinit prodomain sequences are 
comprised of several domains bounded by prot- 
eotysts sites, any one of wNch is synthesized here- 
in separately or in combination with other domains. 
The principal porcine domains fall within resi- 
dues 1 to about 70 (domain 1). atXMJt 70 to atiout 
110 (domain II). about 110 to about 180 (domain 
III), about 180 to about 260 (domain IV}, and about 
270 to about 309 (domain V). In particular, the 
porcine 3*. domains are GHSAAPDCPSCALATLPK- 
DVPNSQPEMVEAV. HILNMLHLKKRPDVTQPVP- 



KAALLNAl. LHVGKVGENGYVELEDDIG. AEMNEL- 
MEQTSEIITFAEAGRARKTLRFEISKEGSDLSWER- 
AEAVLFKVPKANRTRTKV StRLFQQQ. 
PQGSADAGEEAEDVGFPEEKSEVUSEKWDA. 
5 STWHiFPVSSSlQRLLDQGKSALDIRTACEQCHET- 
GASLVLLG. and GHSAAPDCPSCALATLPKOVPN- 
SQPEMVEAVkKHILNMLHLKKflPDVTQPVPKAALL- 
NAi. The porcine fis domains comprise RAAH1LL- 
HAVRVSGWLNL as well as homologous B donnains 
10 having the same sequences. The porcine or do- 
mains comprise GPEUDRELVIAKVRALFLDALGP- 
PAVTGEGGDPGV and GSEPEEEDVSQAfLFPAT- 
GARCGAEPAAG£LAREAEEGU=TYVGRPSQHTHS- 
RQVTSAQtWFHTGL DRQGMAAANSSGPIXDL- 
75 LALSSRGPVAVPMSLGQAPPRWAVLHLAASALPL- 
LTHPVLVLLLRCPLC SCSARPEATPFLVAHTRARP- 
PSGGERA. A typical combination domain polypep- 
tide would t>e 4a domain II linked at its C-terminus 
to the NHi-termtnus of 0a domain 111. in addition. 
20 these domains are fused together by the prot- 
eolysis sites found in the sequences shown in Figs. 
1 B or 2B. by 1 to 4 residue polypeptides that are 
resistant to hydrolysis (for example, gtutaminyt or 
histidyl residues), or are directly fused, whereby, in 
25 all three instances, combination domain polypep- 
tides are produced. 

PrincipaJ human o chain prodomains are ap- 
proximately residues 30-199 and 1 to 29, human 
ffA prodomains are approximately residues 1-30, 
30 32-40. 43-59. 62-80. 83-185 ar>d 18&-230 white 
human fi^ prodomains are approximately residues 
1-13, 15-30. 32-59, 62-145. 148-195 and 198-241 - 
(referring to the nixmboring system adopted in Figs. 
6. 6 and 9. respectively). Combination prodomain 
35 polypeptides are within tt)e scope hereof, for exam- 
ple, the ^AProdomain at about 43-80. ar>d fin ixo- 
domains at about 1-30 and about 32-145. The 
preferred human a. 0^ and 0b chain prodomains 
are about residues 1-29. about 43-80 and afc>out 1- 
40 30. respectively. 

The intact isolated Qrepro or prodomain 4a. 09 
or a sequences are best synthesized in recom- 
binant cell culture. The individual sutx^omponent 
domains are synthesized by routine mett>ods of 
45 organic chemistry or by recwnbinant cell culture. 
They then are labelled with a radioisotope or other 
detectable group such as an enzyme or 
fluorophore in accord with known methods and 
used in standard competitive immunoassays to de- 
50 tect the levels of prepro or pro forms of inhibin. 
including individual domains, in transfonmants with 
DNA encoding such forms or their precursors. This 
assay is useful in determining whether proteolytic 
hydroiysis of proinhibin is occurring in the host 
55 transformants or their culture media. The assay 
also is useful in determining whether a rate limiting 
step in recombinant synthesis is translation of 
mRNA into the prepro forms or processing of the 
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prepro forms into mature mhibin. For example, high 
levels of prepro or pro inhibin in call lysates. but 
relatively low levels of secreted mature inhibin. 
would suggest that the host cell is adequately 
transcribing and translating the inhibin DNA. but is s 
not processing the precursors at an adequate rate. 
Thus, in this case one would select an alternate 
host ceil rather than concentrating on improving the 
transcription or translation efficiency of the trans- 
forming plasmid. e.g.. by selecting an attemate ia 
promoter. The prodomain sequences also are be- 
lieved to be involved in coordinate modulation of 
animal physiology in reproductfve cycles and fertil- 
ity. 

Amino acid sequence variants are any one of is 
1) hormonally-active, 2) cross reactive with anti- 
bodies raised against mature inhibin or prodomain 
a or ^ chain sequences, or 3) cross-reactive with 
inhibin/activin cell surface receptors, but are char- 
acterized by a primary amino acid sequence that 20 
departs from the sequence of riaturaJ inhibins or 
prodomain sequences. These derivatives ordinarily 
are prepared by introducing insertions, deletions or 
substitutions of nucleotides into the DNA encoding 
the target DNA to be modified in order to encode 2s 
the variant and thereafter expressing the DNA in 
recomt>inant ceil culture. Polypeptides having up to 
about 100-150 residues also are conveniently pre- 
pared by invitro synthesis. Such variants are char- 
acterized by the predetermined nature of the van- 30 
ation, a feature that sets them apart from naturally 
occurring allelic or interspecies variation. The vari- 
ams may exhibit the same qualitative tHotogical 
activity as the naturally-occurring analogue or may 
act arrtagonisticBlly towards such analogues. as 

White the site for irttroducing a sequence vari- 
ation is pfedetarmir)ed, rt is unr>ecessary that the 
mutation Q^r se be prBdetermined. For example, in 
order to optimize the performance of mutation at a 
given- site, random mutagenesis may be conducted 4o 
at the target codon or region ar>d tt)e expressed 
inhibin mutants screened for the optimal combinar 
tion of desired activity. Techniques for maidng sub- 
stitution mutations at predetermined sites in DNA 
having a known sequence is welt known, for exam- 45 
pie Ml 3 primer mutagenesis. 

Mutagenesis is conducted by making amino 
acid insertions, usually on the order of about from 

1 to 10 amino add residues, or deletions of about 
from 1 to 30 residues. Deletions or irtsertions pref- so 
erably are made in adjacent pairs. I.e. a deletion of 

2 residues or insertion of 2 residues. Substitutions, 
deletions, insertions or any subcombination may be 
combined to arrive at a final construct. Insertions 
include amino or cartxDxyl-terminal fusions, e.g. a 55 
hydrophobic extension added to the carboxyt termi- 
nus. Preferably, however, only substitution 
mutagenesis is conducted. Obviously, the muta- 



tions in the encoding DNA must not place the 
sequence out of reading frame and preferably will 
not create complementary regions that could pro- 
duce secondary mRNA structure. 

Not all mutations in the DNA which encode the 
polypeptides herein will be expressed in the final 
secreted product For example, a major class of 
DNA substitution mutations are those in which a 
different secretory leader or signal has been substi- 
tuted for the native porcine or human a or 0 chain 
secretory leader, either by deletions within the 
leader sequence or by substitutions, wherein most 
or all of the native leader is exchanged for a leader 
more likely to be recognized by the intended host 
For example, in constructing a procaryotic expres- 
sion vector the porcine or human a or 0 chain 
secretory leader is deleted in favor of the bacterial 
alkaline phosphatase or heat stable enterotoxin II 
leaders, and for yeast the leader is substituted in 
favor of the yeast invertase, alpha factor or add 
phosphatase leaders. However, the porcine and 
human secretory leaders are recognized by many 
heterologous higher eukaryotic cells. When the se- 
cretory leader is "recognized" by the host the host 
signal peptidase is capable of cleaving a fusion of 
the leader polypeptide fused at its C-tenminus to 
the mature inhibin or prodomain such that mature 
inhibin or prodomain polypeptide is secreted. 

Another major class of DNA mutants that are 
not expressed in final form as amino add se- 
quence variations are nucleotide substitutions 
made in the Df^ to enhance expression, primarily 
to avoid 5* stem and loop structures in the tran- 
scribed mHNA (see de Boer §1., EP 75,444A) or 
to provide codons that are more readily transcribed 
by the selected host e.g. the weit-krxown pref- 
erence codons for £. coli or yeast expression. 
These substitutions may or may not encode substi- 
tuted amino acid residues, but preferably do not 

Insertionat and deletional amino add seqtience 
variants are proteins in which one or more amino 
acid residues are introduced into or removed from 
a predetermined site in the target inhitxn, activtn, 
prodomain or proform of inhibin or activrn. Most 
commonly, insertionat variants are fusions of het- 
erologous proteins or polypeptides to the amino or 
cartwxyl terminus of the a or 0 chains, the pro- 
domains or other inhibin derivatives. Immurw^genic 
derivatives are made by fusing an immurK)genic 
polypeptide to the target sequence, e.g. a pro- 
domain polypeptide, by synthesis in vftro or by 
recombinant cell culture ti-ansfbrmed with DNA en- 
coding the fusion. Such immunogenic polypeptides 
preferably are bacterial polypeptides such as 
trpLE. beta-galactosidase and the like, together 
with their immunogerwc fragmerrts. Other insertions 
entail inserting heterologous eukaryotic (e.g. the 
herpes virus gD signal) or microbial secretion sig- 
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nal or protease processing sequences upstream 
from the NHj-terminus of the protein to be sd- 
creted- Deletions of cysteine or other labile resi- 
dues also may be desirable, for example in in- 
creasing the oxtdativ© stability of the a or 4 chain. 
Oeleiion derivatives wilt produce o or ^ chain frag- 
ments. Such fragments, when biologically or im- 
munologically active, are within the scope herein. 
For Instance, a fragment comprising fi^ or 
^A^esidues about from 11 to 45 (numt>ered from 
mature Glyt) is to be included within the scope 
herein. 

Immunogenic conjugates of prodomain poly- 
peptides, inhibin and activin are readily synthe- 
sized in recombinant call cutture as fusions with 
immunogenic polypeptides, e.g. t>eta-lactamasd or 
viral antigens such as herpes gD protein, or by 
preparation of the polypeptides in unfused form (by 
recombinant or in vitro synthetic methods) followed 
by covaient cross-linking to an immunogenic poly- 
peptide such as keyhole limpet hemocyanin or STt 
using a divalent cross-Onkirig agent The im- 
munogenic p>olypeptide5 are formulated with a vac- 
cine adjuvant, e.g. alum or Freunds. Methods for 
preparing proteins in adjuvants and for cross-link- 
ing are well-known -per se and would be employed 
by one skilled in the art as are metf>ods for vacci- 
natir>g animals. The immunogenic conjugates are 
usetui in preparing antibodies to the prodomain 
region for use in monitorir^g inhibin manufacture or 
for in vivo vaccination with the objectiv© of raising 
antibodies capable of modulating antmat physiol- 
ogy in reproductive cycles and fertility. Typica/ly, 
the prodomain or its immunogen is administered in 
varied doses to fertile laboratory animals or swine 
and the reproductive cycles and fertility of the 
animals monitored, togettw with assays of serum 
levels Off anti-immunogen or prodomain by routine 
competitive or sandwich immunoassay. 

Sut)stitution derivatives are produced by mutat- 
ing the ON A in a target codon. so that thereafter a 
different amino acid is encoded by the codon, with 
no concomitant change in the numtser of residues 
present in the molecule expressed from the mu- 
tated DNA. Substitutions or deletions are useful for 
example In increasing the stability of the proteins 
herein by eliminating proteolysis sites, wherein re- 
sidues are substituted within or adjacent to the 
sites or are deleted from the sites, or by Introduc- 
ing additional disulfide bonds through the substitu- 
tion of cysteine for other residues. Substitutions are 
useful for facititating the syntfiesis or recovery of 
mature or prodomain a or 0 chains. For example, 
methionine residues within the mature inhibin se- 
quences are substituted or deleted, prepro se- 
quer>ces deleted, methionine inserted at the -1 site 
immedtateiy NHi terminal to the mature NH, termi- 
nai residue and another sequence inserted N-termi- 



nal to the exogefKHJS methionine. The inhibin de- 
rivative in this case is expressed as a fusion having 
an intermediate methtonyl residue, which in turn is 
cleaved at this residue by cyanogen bromide in 
5 accordance with known practice. The mature in- 
hibin derivative released from the fusion is recov- 
ered. 

Exemplary pcrdne inhilwn derivatives are - 
[Asnwi->Gln]inha (to remove the putative 

w glycosylation site). [Cysm or Cys9«->A]lnha, (Cysmi 
or Cysxr>AJlnha, [Lys», or Lysan-> AJlnh^* or - 
CLys«->His or Serpnh/SA (to inactivate a poterrtial 
proteolysis site). [Ly3,«->Arg; VaUrf->Thr] Inh^A (to 
create a >Sa/4b hybrid), [CySw or CySsM-> Ajtnh^A. 

75 [Lys«M->GlnJInh4A, fArga«->Lys. Val„4->Thr]lnh^a - 
(to create a fia/^A hyt>rid). [Cys,« or Cys»i-> A]- 
Inh^a P''o»,GiyjB-> Pro Phe GlyJInhjSg, and {Arg,.- 
>Gtn]lnh^a, wherein Inh is sn abbreviation for ir>- 
hit>in and the re^due members for Inh^Seare those 

20 used for the corresponding Inh^A residue (see Bg. 
2B). 

The InhiSA amino acid positior^ which are prir>- 
cipal candidates for mutational substitution or dele* 
tion (or adjacent to whk:h residues may be in- 
25 serted) include residues 293-297. 364-376 and 
387-398 (Fig. 8). Preferably, the proline, cysteine 
arxj glycine residues within ttwse sequences are 
not modified. Candidates having greater potency 
than inhibin or activin. or which ser/e as inhibin .or 

3o activin antagonists, are identified by a screening 
assay wf)ere(n the candidate is diluted into solu- 
tions containing constant amounts of inhitxn or 
activin and ttie compositiorw assayed in the rat 
pituitary cell assay. Candidates which ni^9c an- 

35 tagonize or agonize inhibin or activin are screened 
for utility in immunoassays for inhibin or activin by 
measuring competitive immunodtsptacement of 
latMlled inhibin or activin of the native hormones 
from polyclonal antibody directed against the native 

40 hormones. Exemplary contemplated sequence vari- 
ams of Inh^A include Phe3«->lle or Leu; Gtn^ 
>Asp or Lys; Trpw-> Tyr or Phe; Trp,«->Tyr or 
Phe; ne,M->Pt>e or Val; Tyr3,^>Trp or Thr; His,d- 
>Lys; Aia,„->Ser; Asn»->Gln. Tyr or His; Tyra,-> 

45 Thr or Asp, Phe„.->Tyr (a TGF-^/j3a irrtrachain 
hybrid); His»->Asp; His«r>Lys (a ^a/^b hytxid); 
Phe3«->Tyr Val»,->Phe; Val»»->Leu; Tyr»,->Thr; 
Tyr»,->Trp; Asn«.->61n, His or tys; lle3..->Leu or 
Thr M8t>« ->Leu or Ser; Vai«,->Phe, Glu, Thr or 

50 lie. Comparable modifications are made in the hu- 
man ^0 chain. For example. Inh^A contains a 
phenylaiany! residue at position 302, and Inh^a 
also contains a phenylalanyl residue at a homoto- 
gous position (254, Fig. 9) when aligned in tf>e 

55 same fashion as is shown for porcine fi^ in Fig. 4A. 
Thus, since the Phe^^ residue of j3a is described 
above as substituted by isoleucinyt or leucinyt. the 
Phe^* of is substituted with the same residues. 
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A factor in establishing the identity of a poly- 
peptide as inhibin. activf'n or an inhibin variant is 
the ability of antisera which ar capable of substan- 
tially neutralizing the hormonal activrty of mature 
inhibin or activin to also substantially neutralize the s 
hormonal activity of the polypeptide in question. 
However rt will be recognized that immunological 
identity and hormonal activity are not necessarily 
coextensive. For exampte, a neutralizing antibody 
for inhibin may not bind a candidate protein t)e- ro 
cause the neutralizing antibody happens to not t>e 
directed to spedficaliy bind a site on inhibin that is 
critical to its activrty. Instead, the antibody may 
bind an innocuous region and exert its neutrafizing 
effect by steric hinderance. Therefore a candidate 75 
protein mutated in ttiis innocuous region migtrt no 
longer bind the neutralizing antibody, but it would 
nonetheless fc>e inhibin in temis of substantial ho- 
mology and biological activity. 

It is important to observe that characteristics 20 
such as molecular weight, tsoelectrtc point and the 
like for a native or wild type mature inhibin or 
activin obtained from follicular fluid or other tissue 
sources are descriptive only for the native form. 
Variants contemplated by the foregoing definition 25 
will include other polypeptides which will not ex- 
hibit all of the characteristics of native analogue. 
For exan^le, inhibin derivatives fike the insertion 
mutants, deletion mutants, or fusion proteins de- 
scribed above will bring inhibin outside of ttte mo- 30 
lecular weight established for . Vn& corresponding 
native inhibin because fusion proteins with mature 
inhibin or proinhibin itself as well as insertion mut- 
ants will have a greater molecular weight ttian 
native, mature inhibin. On the other hand deletion 35 
mutants of native, mature inhii>in will have a lower 
molecular weight Rnalfy. post-translationai pro- 
cessing of preproinhibin chains in heterologous ceil 
fines may not be accomplished with the tideGty 
exercised hy the homologous host celt, thereby 40 
resulting in some variation in the amino temiini of 
the a and/or 0 chains. This variation may be en- 
countered as residual prosequence remaining with 
the mature protein, or the loss of several nnature 
residues that are cleaved off with the prosequence. 45 
The same is true with processing of the preprotein 
in heterologous recombinant cells. 

Covalent modifications of inhibin. activin or pro- 
domains are included within the scope hereof and 
include covalent or aggregative conjugates with so 
other chemical moieties. Covalent derivatives are 
prepared by Bnkage of fur>ctionalities to groups 
which are found in the inhibin amino add side 
chains or at the ^4-or C^termini. by means known in 
the arL For example, these derivatives wilt include: 55 
aUphatic esters or amides of the cartxsxyl terminus 
or residues containing carboxyl side chains, e.g,, 
aspartyl residues; O-acy! derivatives of hydroxyl 



group-containing residues such as seryl or alanyt; 
and N-acyi derivatives of the amino terminal amino 
add or amino-group containirrg residues, e.g. 
lysine or arginine. The acyl group is selected from 
the group of alkyl-moieties (induding C3 to CIO 
normal alkyi). thereby forming aikanoyi species, 
and carbocyctic or heterocyciic compounds, there- 
by forming aroyi spedes. The reactive groups pref- 
erably are difunctional compour^ known 2gr ^ 
for use in cross-linking proteins to insoluble ma- 
trices through reactive side groups, e.g, m- 
Maieimidobenzoyl-N-hydroxy sucdnimide ester. 
Preferred derivatization sites are at histidrne resi- 
dues. 

Covalent or aggregatwe derivatives of mature 
inhibin. activin or prodomain sequences are useful 
as reagents in immunoassay or for affinity purifica- 
tion procedures. For example, inhibin or prodomain 
is insolubilized by covalent bonding to cyanogen 
bromide-activated Sepharose by methods knovvn 
Cgr 2g or adsortaed to polyotefin surfaces (with or 
without glutaraldehyde cross-linking) for use in the 
assay or purification of anti-inhibin or anti-pro- 
domain antibodies or cell surface receptors. Inhibin 
or a prodomain sequence also ts labelled with a 
detectable group, e.g.. radioiodinated by the chlo- 
ramine T procedure, covalentiy bound to rare earth 
ct>elates or conjugated to anottier fluorescent moi- 
ety for use in diagnostic assays, espedalty for 
diagnosis of inhibin or prodomain levels in biologi- 
cal samples by competitive-type immunoassays. 

ONA which encodes the complete ce and 0 
chains of inhibin/activin is obtained by chemical 
synthesis, by screening reverse transcripts of 
mRNA from ovary, or by screening genomic librar- 
ies from any cell. It may be more effident to 
simply synthesize portions of the DNA desired 
since screening is required to identify DNA to 
cONA or genomic libraries that encode the a and 
chains. Synthesis also is advantageous t>ecause 
unique restriction sites can be introduced at the 
time of preparing tfie DNA, thereby fadRtating the 
use of the gene in vectors containing restriction 
sites otherwise not present in the native sequence, 
and steps can be taken to enhance translaticnal 
efficiency as discussed above, without the need to 
further modify the DNA as by mutagenesis or the 
like. cDNA encoding the o or j8 chains is free of 
untranslated Intervening sequences (introns) as well 
as free of flanking DNA encoding other proteins 
homologous to their source. 

DNA encoding the a and 0 chains is obtained 
from other sources than pordne and human by (a) 
obtaining a cDNA library from tiie ovary of the 
target animal, (b) conducting Southern analysis 
with labelled DNA encoding pordne or human a 
and 0 chains or fragments thereof (generally* great- 
er than 100 bp) in order to detect clones in the 
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cDNA library that contain homotogous sequences. - 
(c) anaiy2ing the clones by restricting enzyme ana- 
lysts and nucleic acid sequencing so as to identify 
fulHengtfi clones and, if full length clones are not 
present in the library, recovering appropriate frag- 
ments from the various clones and ligating them at 
restriction sites connmon to the clones to assemble 
a clone encoding the fuiWength molecule. As 
shown infra, any sequences mi'sstng from the R- 
brary can be obtained by the 3' extension on 
ovarian mRNA of synthetic otigodeoxynucleotides 
complementary to cDNA identified by screening 
the library, or homologous sequences are suppCed 
from known animaJ cDNAs. This is particutarty use- 
ful in constructing pre or prepro inhibin sequences 
to facilitate processing of preproinhibin to mature 
tnhibin from the desired species. 

Porcine and human ovarian cONA libraries ini- 
ttally were probed for DNA encoding inhibin se- 
quences using latjelled ofigonucleotides whose se- 
quence was based on the partial amino add se- 
quence determined from analysis of purified por- 
cine inhibin or, in the case of human cOMA, por- 
cine cDNA probes. However, once having de- 
scribed cONA encoding human and pordne inhibin 
and prodomains, one skilled in the art would realize 
that precisely hybridizing probes can be prepared 
from the descritDed sequences in order to readily 
obtain the remainder of the desired human or por- 
dne gene. 

Nucleotide sequence analyses of identified por- 
dne and human cOF^A clones revealed the struc- 
tures of the biosynthetic precursors of t»th forms 
of jnhtt»n. Interestingly, the two inhibin chains are 
not derived from a single processed precursor. 
Instead, the two chains are translated from sepa- 
rate mRNAs and then assembled into the disulfide 
crossttnked two-chain molecule. 

Figs. 1B and 2B and 6, 6 and 9 depict the ONA 
encoding the polypeptide chains constituting por- 
cine and human preproinhibin and preproactivin. 
Obviously, degenerate codons may be mjbstituted 
for those disdosed in these figures where tfie 
same amino acid is encoded. The DNA of Figs. IB. 
26^ 6. 8 and 9 is mutated in order to encode the 
amino acid variants of the a and 0 chains de- 
scrit^ed above. In particular, the prepro sequences 
are deleted and a start codon inserted immediately 
5* to the mature cf^ain in question so that the chain 
is expressed directly in recombinarrt culture. The 
DNA also is labelled, e.g. with radioactive phos- 
phorous, and used to screen ovarian cONA libraries 
from other spedes to identify cx or ;9 cfiain encod- 
ing DNA from such otiier spedes as is generally 
described above. 



Coval nt labelling of this DNA is accomplished 
with a detectabi substance such as fluorescent 
group, a radioactive atom or a chem/iumtr>escent 
group by methods known ggr ig. The lat?elied DNA 
5 is then used in conventional hybridization assays. 
Such assays are employed in identifying vectors 
and transformants as described in examples infra. 
or for in vttro diagnosis such as' detection of 
mRNA in tissues. 
70 Lengthy sequences desirably are synthesized 

in host cells transformed with vectors containing 
DNA encoding them, e.g. inhibin or prodomain 
sequence. Vectors are used to amplify the DNA 
which encodes the chains, either in order to pre- 
IS pare quantities of DNA for ' further processir^g 
(doning vectors) or for expression of the chains - 
(expression vectors). An expression vector is a 
repficable ONA construct in which a DNA sequerxa» 
encoding an a or ^ chain is operabiy Gnked to 
20 suitable control sequences capable of effecting 
tiieir expression in a suitable host Cloning vectors 
need not comain expression corrtrol sequences. 
Such control sequences include a transcriptional 
promoter, an optional operator sequence to control 
ss transcription, a sequence encoding suitabie mRNA 
ribosomal tending sites (for prokaryotic expres- 
sion), and sequences whic:h control termination of 
transcription and translation. The vector shoukj in- 
dude a selection gene to fadlitate the stable ex- 
30 pression of the desired polypeptide and/or to iden- 
tify transformarrts. However, the selection gene for 
maintaining a and/or chain expression can be 
supplied by a separate vector in cotransfbrmation 
systems using eukaryotic host ceils. 
3S Vectors comprise ptasmids, viruses (induding 

phage), and integratable ONA fragments i.e.« frag- 
ments that are integratabie into the host genome 
by recomt)mation. The vectors descrit>ed herein for 
use in eukaryotic cell expression of inhibin o and/or 
40 0 chains contain plasmid sequences for dor^ng in 
microtses. wtiere the plasmid replicates autono- 
mousiy from the host genome, but the DNA is 
believed to integrate into the eukaryotic host cell 
genofr>e upon transformation. Similariy, badtkis 
45 vectors that genomicalty integrate by homologous 
recombination in t>adHu5 also are useful. However, 
all other forms of vectors which serve an equivalent 
function and whidi are, or t>ecome, known in the 
art are suitable for use herein, 
so Suitable vectors generally will contain repJicon 

(origins of repfication, for use in norvintegrative 
vectors) and control sequences which are derived 
from spedes compatible with the intended expres- 
sion host Transformed host calls are cells which 
55 have been transformed or transfected with vectors 
containing inhibin a and/or 0 chain encoding DNA. 
Transformed host cells contain doned Df^ and, 
when transformed with an expression vector, also 
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express the a and/or chains. The expressed 
polypeptides will be deposited intracenulariy or se- 
creted into either the periplasmic space or the 
culture supernatant, depending upon the host ceil 
selected and the presence of suitable procsssing 
signals in the expressed protein, e.g. homologous 
or heterologous signal sequences. 

DNA regions are operably Onked when they are 
functionally related to each other. For example, 
ONA for a presequence or secretory leader is op- 
erably Bnked to DNA for a polypeptide if it is 
expressed as a preprotein which participates in the 
secretion of the polypeptide: a promoter is op- 
erably finked to a coding sequence if it controls the 
transcription of the sequence; or a nlxisome bind- 
ing site is operabty linked to a coding sequence if 
It is positioned so as to permit transiation. Gen- 
erally, operably linked means that the DNA se- 
quences being linked are contiguous and, in the 
case of secretory leaders, contiguous and in read- 
ing phase. 

Suitable host cells are prokaryotes, yeast or 
higher eukaryotic ceils, Prokaryotes include gram 
negative or gram positive orgar^sms, for example 
colF or Bacilli. Higher eukaryotic cells indude 
established call lines of mammalian origin as de- 
scrit>ed below. A preferred host cell is ooli 294 - 
(ATCC 31 .446) although other prokaryotes such as 
E. C2ii B. soli X1776 (ATCC 31.537), est 
VV3110 (ATCC 27.325). pseudomonas species, or 
Serratia Marcesans are suitable. 

Expression vectors for host -cells ordinarily in- 
clude an origin of replication (where extraciv 
romosomal amplification is desired, as in cloning, 
the origin will be a bacterial origin), a promoter 
located upstream from the inhibin coding se- 
quences, together with a ribosome binding side - 
(the ribosome binding or Sh'me-Daigamo sequence 
is only needed for prokaryotic expression), RNA 
splice site (rf the inhibin DNA contains genomic 
DNA containing one or more introns), a 
polyadenylation site, and a transcriptional termir^a- 
tion sequence. As noted, the skilled artisan will 
appreciate that certain of these sequences are not 
required for expression in certain hosts. An expres- 
sion vector for use with microbes need only contain 
an origin of replication recognized by the intended 
host, a promoter which will function in the host and 
a phenotypic selection gene, for example, a gene 
encoding proteins conferring antibiotic resistance 
or supplying an auxotrophic requirement Inhibin 
DNA is typically cloned in ^ coli using pBR322. a 
plasmid derived from an gj. spedes (Bolivar, 
ai-, 1977 "Gene" g: 95. pBR322 contains genes for 
ampicillin and tetracycline resistance and thus pro- 
vides easy means for identifying transformed cells. 



Expression vectors, unlike cloning vectors, 
must contain a promoter which is recognized by 
the host organism. This is generally a promoter 
homologous to the intended host Promoters most 

s commonly used in recombinant DNA construction 
include tiie ;5-lactamase (peniallinase) and lactose 
promoter systems (Chang gt a!-. 1978, "Nature", 
275; 615: and Goeddel et ai., 1979. "Nature* 281 : 
544), a tryptophan (trp) promoter system (Goeddel 

TO et a(., 1980. 'Nucleic Acids Res." 4057 and EPO 
Appl. PutJl. No. 36,776) and the tac promoter [H. 
De Boer gi al „ 1983. "Proa Nafl. Acad. Sd. 
U.SJ\." 21-25]. While these are the most com- 
monly used, other known microbial promoters are 

15 suitable. Details concerning their nucleotide se- 
quences have been published, enabling a skilled 
worker operably to ligate them to DNA encoding 
inhibin in plasmid vectors (Siebenlist s| gl., 1980, 
"Cell" 22: 269) and the DNA encoding inWbin or its 

20 derivative. Promoters for use in prokaryotic expres- 
sion systems also will contain a Shine-Oalgamo - 
(S.D.) sequence operably linked to the DNA encod- 
ing the inhibin, i.e., the S.D. sequence is positioned 
so as to facifitate tvBnslatlon. (3er>erally. this means 

25 that the promoter and SJD, sequences kxrated up- 
stream from the second codon of a bacterial struc- 
tural gene are substituted for the sequerxies of 
prepro inhibin located 5* to the mature a and/or 0 
chains. 

00 In addition to prokaryotes, eukaryotic microbes 

such as yeast cultures are trarisfbrmed with inhibin- 

- - encoding vectors. Saccharomvces cerevisiae. or 
common baker's yeast is the most commonly 
used among lower eukaryotic host microorganisms. 

35 However, a number of other strains are commonly 
available and useful herein. Yeast vectors generally 
will contain an origin of replk:ation from the 2 
micron yeast plasmid or an autonomously replicat- 
ing sequefjce (ARS), a promoter, DNA encodir>g 

^ the cc and/or S chain, sequences for polyadenyla- 
tion and transcription termination, and a selection 
gene. A suitable plasmid for expression in yeast is 
YRp7. (Stinchcomb gj ai» 1979. "Nature", ^gg: 3S: 
Kingsman §1 ^ . . 1979, "Gene", 7: 141; T- 

45 schemper al., 1980, "Gene". 10:157). This plas- 
mid already contains the trgi gene which provides 
a selection marker for a mutant strain of yeast 
lacking the at>ility to grow in tryptophan, for exam- 
ple ATCC No. 44076 or PEP4-1 (Jones, 1977. 

so "Genetics", fig: 12). The presence of the trpi lesion 
in the yeast host cell gerujme than provides an 
effective environment for detecting transformation 
by growth in the absence of tryptophan. 

Suitable promoting sequences in yeast vectors 

ss include the promoters for metallothionein, 3- 
phosphoglycerate kinase (Hitseman gi ^1^980, "J. 
Biol. Chem.", 255 : 2073) or oftier glycolytic en- 
zymes (Hess ej al., 1968, "J. Adv. Enzyme Reg.", 
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7: 149: and HoHand a!., 1978, "Siochemist^y'^ 
17, 4900). sucft as enolase. g/yceraid6nyde-3-phos- 
phat© dehydrogenase, hexokinase, pyruvate decar- 
boxylate, phosphofructokinase, glucose-6-phQs- 
phate tsomerase, 3-phosphogtycerate mutase, 
pyruvate kinase, triosephosphate isomerase. 
phosphoglucQse isomerase. and glucokinase. Suit- 
able vectors and promoters for use in yeast ex- 
pression ar further described in R. Hltzeman et aL. 
EP 73,657A, 

Other yeast promoters, which have the addi- 
tionaf advantage of transcription corttrolted by 
growth conditions, are the promoter regions for 
aicohoJ dehydrogenase 2, isocytochronrt© C. acid 
phosphatase, degradative enzymes associated with 
nitrogen metabolism, and the aforementioned 
metailothionetn and glyceraidehyde-3-phosphate 
dehydrogenase, as well as enzymes responsible 
for mattose and galactose utiiization. \n construct- 
ing suitable expression pfasmids, the termination 
sequences associated with these genes are aiso 
ligated *mto the expression vector 3* of the inhibin 
or derivative coding sequences to provide termina* 
lion and poiyadenyiation of the mRf^ 

Cuftures of ceils derived from multiceltular or- 
ganisms are the preferred host cells herein be- 
cause it is t^etieved that expression of hormonaily 
active inhit)in or acthrtn will only occur in such 
cells, with microbial expression resuttir>g at most 
only in immunological cross-reactrvity. In princtpie, 
any higher eukaryotic cell culture is workable, 
whether from vertebrate or invertebrate cutture. 
Propagation of vertebrate cells in culture Q§r s§^ 
has become a routine procedure in recent years - 
(Tissue Culture . Academic Press. Kruse and Patter- 
son, editors (1973)], 

Suitable host cells for expressing a or ^ chains 
in higher eukaryotes ir)clude: monkey kidney CVI 
line transformed by SV40 (COS-7. ATCC CRL 
16S1); baby hamster kidney calls (BHK, ATCC CRL 
10): Chinese hamster ovary-calls-DHFR (described 
by Uriaub and Chasin. PNAS (USA) 77: 421 a, - 
neeO]); mouse Sertoli cells {TM4. Mather. J.P.. 
SUA. Reprod- 23 : 243-251 [1980]); monkey kidney 
cells (CVI ATCC CCt 70): african green monkey 
kidney ceils (VERO-76, ATCC CRL-1587): human 
cenrtcal carcinoma cells (HELA. ATCC CCL 2); 
canine kidney cells (MDCK, ATCC CCL 34); buffaJo 
rat Hver cells (BRL 3A, ATCC CRL 1442); human 
lung cells {W13a, ATCC CCL 75): human liver cells 
(Hep G2, HB 8065); mouse mammary tumor (MMT 
060652, ATCC CCL 51); rat hepatoma cells (HTC. 
Ml. 54. Baumann. M., jl J- Cell BioL 1-8 - 
(1980]) and TRI ceils (Mather, . J.P. ej al., Annais 
N.Y. Acad, Sci. 383 : 44-88 [1982]). 



The transcriptionai and translation control se- 
quences in vertebrate ceil expression vectors pref- 
erably are provided from viral sources. For exam- 
ple, commonly used promoters are derived from 

5 polyoma. Adenovirus 2, and most preferably Sim- 
ian Virus 40 (SV40). The early and late promoters 
of SV40 are particutarfy useful because both are 
obtained easily from tfw virus as a fragrrwit which 
also contains the SV40 viral origin of replication - 

70 (Frers si-. 1S7B. "Nature", 273 : 113). Smaller or 
larger SV40 fragments may also be used, provided 
the approximately 250 bp sequence extending from 
tt>e Hind til site toward the Bq\ I site located in the 
viral origin of the replication is included. Further, it 

75 is aiso possible to utilize the genomic promoters, 
control and/or signal sequences normally associ- 
ated with the a or ^-chains, provided each control 
sequences are compatible with and recognized by 
the host cell. 

20 An origin of replication may be provided either 

by construction of the vector to include an exoge- 
nous origin, such as may be obtained from SV40 
or other viral (e.g. Polyoma. Adenovirus. VSV. or 
BFV) source, or may be provided by the host call 
25 chromosomal replication mechanism. If the vector 
is integrated into the host celt chromosome, the 
latter is often sufficient. 

Rather than using vectors which contain viral 
origins of repfication, mammaiian cells are cotrans* 
30 formed with DNA efKWfing a selectable marker 
and DNA er»coding the a and/or chains. An 
example of a suitable selectable marker is 
dihydrofolate reductase (DHFB) or Unyrnldtne 
kinase. Such markers are proteins, generally an- 
as zymes that enable the identification of trarisformant 
cells, i.e.. celts which had been competent to take 
up exogenous Generally, identification is by 

survival of transformants in cufture mecfium that is 
toxic to untrarwformed cells or from which the cells 
40 canrKt obtain a critical nutrient without having tak- 
en up the marker protein. 

In selecting a preferred host mammalian call 
for transfection by vectors which comprise DNA 
sequences encoding both inhibin arxi DHFR, ft is 
-<« appropriate to select the host according to the type 
of DHFR protein employed, (f wild type OHFR 
protein is employed* it is preferable to select a host 
cell which is deficient in DHFR thus permittirrg the 
use of the OHFR coding sequence as marker for 
50 successful transfection in selective medium which 
lacks hypoxanthine, glycine. afKj thymidine (hgT). 
An appropriate host cell in this case is tiie Chinese 
hamster ovary (CHO) call line deficient tn DHFR 
activity, prepared and propagated as described by 
55 Urtaub and Chasin, 1980, "Proc., Nat'I. Acad. Sd." 
(USA) 77: 4216. 
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On the other hand, if DNA encoding DHFR 
protein with low binding afftnity for methotrexate - 
(MTX) is used as the controlling sequence, ft is not 
necessary to use DHFR resistant cells. Because 
the mutant DHFR is resistant to MTX, MTX con- s 
taining media can be used as a means of selection 
provided that the host cells are themseh^ MTX 
sensitive. Most eukaryotic calls which are capable 
of absorbing MTX appear to be methotrexate sen- 
sitive. One such useful cell line is a CHO line, io 
CHO-KI {ATCC No. CCL 61). Preferably, transfor- 
mants are first selected for neomycin resistance - 
(the transfection is conducted together with DNA 
encoding the neomycin resistance gene), followed 
by MTX amplification of the a and/or $ chain ts 
expression as the case may be. See Kim Sl •» 
"Ceir 4g: 129-138 (1986) and EP 16Q.457A. 

Other methods suitable for adaptation to the 
synthesis of a and/or 0 chains in recombinant 
vertebrate ceil culture are described in M-J. Ge- 20 
thing gt al.. "Nature" 29^: 620-625 (1981): H. Man- 
tei St al-. "Nature" 281: 40-46; and A. Levinson ej 
al.. EP 117.060A and 117.058A. 

The inhtbin a chain is expressed in recom- 
binant cell culture with or without either of the 0- 2S 
chain molecules. Similarly, host cells are trans- 
formed with DNA encoding either or both of the 
mature ;3-chains. Based on analogy to TGF-^. the 
mature ^-chains are capable of forming homo- 
cfimers or ^A/^^sHeterodlmers upon expression in 30 
recombinant culture. These structures are not in- 
htbin and will be referred to herein- as ;5-chain 
dimers or activin. These are useful in the prepara- 
tion of active inhibin, serving as sources of ttte 0- 
chain, or are used as gel electrophoresis standards 3S 
to detect the diversion into 4-chain dimers of 4- 
chains synthesized in a and chain cotransfor- 
mants. As will be seen In Example 4, this is not a 
hypothetical problem. Of course, the dimers are 
also useful in modulating reproduction as noted 40 
above. 

^-chain hetero or homodimers are separated 
by in vitro unfolding of the individual chains fol- 
lowed by oxidative disulfide fcx3nd formation with 
the a-chain in accord wfth processes generally 45 
known cgr s^. Preferably, however, in preparing 
mature inhibin the recombinant host is transformed 
with DNA encoding both the tx and either of the fi- 
chains. The intact hormonally active molecule is 
then assembled by the host cell jn vivo, and it Is so 
thus unnecessary to combing the two chains by in 
vitro processing. The DNA encoding the a and 0' 
chains is preferably located on the same vector, 
and under ttie control of the same promoter, but 
this is not essential. 55 



Certain iS-chain amino acid sequence variants 
identified tn the screening procedure will not bind 
to pituitary call surface receptors nor as a con- 
sequence will they exhibit hom^onaf activity. Such 
variants, when expressed as homodimers in recom- 
binant cell culture, are useful in immunoassays for 
activin when they bear immurwlogical epitopes 
cross-reactivo with the native >3-chain. In addition, 
such variants are coexpressed with DNA encoding 
hormonally active ^-chain to yield a hybrid bearing 
native and variant ^-chain. In this case the variarrt 
serves to stabinze the structure of the native 0- 
chain. This form of ^-chain heterodimer is useful, 
like the homodimer, in immunoassays for activin. It 
may also function as an activin antagonist. 

The activin/inhibin js-chains also are coexpres- 
sed wth TGF-j5 in order to produce ^-chain/TGF-^ 
hybrids. Vectors and methods for the expression of 
TGF-^ are known. For example, see DeryrK;ic st 
Human Transforming Growth Factor-)? Complemef>- 
tary DNA Sequence and Expression in Normal and 
Transformed Cells "Nature" 3i6: 701-705 (1985). 
Contransformation of mammalian host cells by vec- 
tors bearing the TGF-/3 gene as described by De- 
ryr^ fii al. together with the 0a or 0a chains of 
activin/inhibin will result in secretion of a proportion 
of a /3-chain/TGF-;3 hybrid dimers. This hybrid is 
useful in preparing TGF-/ff/^-chain immunogens or 
in immunoassays. 

Inhibin, activin or prodomain sequences are 
recovered from transformed cells in accord with 
Bgr §e knowrr procedures. When a polypeptide- is 
expressed in recombinant bacteria as a refractite 
body, the desired polypeptide is recovered ar>d 
refolded by conventional methods. Alternatively, 
the culture supematants from transformed cells that 
secrete activin or inhibin, preferably mammalian 
cells, are simply separated from the cells by cern 
trifugation. Then the inhibin generally is purified by 
successive purification procedures that include 
heparin-Sepharosa affinity diromatography. gel fil- 
tration and at least one and preferably several RP- 
HPLC (reverse phase high pressure liquid 
chromatography) steps using different conditions in 
the stationary phase and/or mobile phase. Pro- 
domain sequences produced by in vitro synthesis 
will be purified by conventional methods. 

The prodomain polypeptides that are preferred 
for use herein are recovered from the culture me- 
dia of recombinant ceils transformed to syrrthesize 
the a and/or 0 chains as appropriate for the desired 
prodomain. Specifically,^ they are recovered by 
separating Xhe culture medium polypeptides on na- 
tive electrophoresis gel, excising bands having the 
predicted molecular weight and thereafter purifying 
the eiuted polypeptides further, for example by 
FPLC or HPLC, followed by amino acid sequence 
determination for the substantially homogeneous 
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separated polypeptides. Purified prodomain poly- 
peptides then are used to raise antibodies, e.g,, in 
rabbits, which wh n used in immunoaffinity pu- 
rification wiil simplify the recovery of the pro- 
domains. 5 

In the preferred procedure for Isolating porcine 
hormonally active inhtbin, ciarffied transfonnant cul- 
ture supernatant or celt tysate is first purified by 
heparin-Sepharose affinity chrorr^atography, next 
by gel filtration on Sephacryl S-200 gel and then to 
with four successive RP-HPLCs using different mo- 
bite phase gradients and/or derivatizsd silica sup- 
ports. Preferably, stationary phases having rela- 
tively low hydrophobicity are used, with C3-CS 
columns being preferred and C3-C5 and phenyl 75 
columns being particularly preferred. Solute sped- 
ficity of the mobiie phase is preferably adjusted by 
varying the concentration of an organic component, 
parttcuiarfy acatonitrile. Atthough a single RP-HPLC 
fractionation significantly increases the purfty refa- 2a 
tive to the gei-fittrated material, two or more, and 
preferably four. RP-HPLC purifications are gener- 
alty performed subsequent to successive treatment 
by heparin-Sepharose chromatography and gei fil- 
tration. This method has been found to be adapt- 2s 
able to the purification of human inhibin from re- 
combinant cell culture as well. 

The first step of the purification is heparin- 
Sepharose affinity chromatography, in which the 
protein is adsort^ed to the Sepharose-bound 30 
heparin moieties under application conditions, and 
tf)e adsort:)ed inhibin material is recovered by 1M 
NaCi etution. This step greatly expedites the pu- 
rification procedure for crude extracts because it 
allows a relatrvety large volume of a crude extract os 
to be processed fairiy rapidly white recovering an 
amount of protein exhibiting total inhibin activity 
equal to at least 90% of that of the crude extract 

For the detection of inhibin activity in the var- 
ious column fractions, aliquots ranging from 0,01% 40 
to 0.1% by volume are removed, and after added 
100 tig human serum albumin in lOOul water, the 
solvents were evaporated in a Speed-Vac con- 
centrator (Savant Hidcsville, N.Y.). The residue as 
redissolved in 3 ml 1% fetal bovine serum in 45 
HOMEM, fittered through a Mtllex-GS 0.22 um Htter 
(Mtlltpore Corp., Bedford, MA) and assayed in du- 
pBcate. To speed up the bioassays during the 
purification process, only basai inhibition of FSH 
secretion exerted by tfie inhibin activity is deter- so 
nursed and plotted in the region where the inhit)in 
proteins were expected to migrate in the chromato- 
grams. 

To perform the heparin-Sepharose affinity 
chromatography, cell debris is spun down in a 55 
Beckman J2-21 centrifuge (Beckman Irrstruments, 
Inc.. Palo Alto, CA.) using a JA-20 rotor at 10.000 
rpm for 30 minutes. One half of the supernatant is 



diluted in 10 times its voium by the addition of 
0.01 M Tris-HCl containing 0.1 M f^Cl, pH 7. in an 
Erienm yer flask and pumped simultaneously via 
silastic tut>es {0,76 mm ID) into heparir>-Sepharose 
(Pharmacia Rne Chemicals. Piscataway, N.J.) col- 
umns (3.5 X 9 cm) by two Rabbit 4-channel peri- 
staltic pumps (Rainin Instrument Co., )nc,, Em- 
eryville. CA) at 40 ml/hr per column. After ail the 
fluid has been pumped through the heparin- 
Sepharose, the eight columns are washed simulta- 
neously wtttt Q.OI M Tris-HCI, pH 7. containing 0.1 
M NaCl in the same manner. The' adsort^ed pro- 
teins, with inhibin activity are removed by washing 
the eight columns simultaneoosly with 0,01 M Tris- 
HCl containing 1 M NaCI. pH 7, as above, and the 
wash is collected into fractions. The inhibin activity 
is monitored by the ja vitro bioassay described 
above. The columns are regenerated by further 
washing with 2M NaCI in 0.01 M Tris-HCl, pH 7, 
arid re-^uillbrated with O.Oi M Tris-HCI containing 
0.1 M NaCI for purification of remaining extract 

Next the material is fractionated by gel filtra- 
tion to separate proteins gerierally according to 
their mo^eciitar weights. The fractiorts having in- 
hibin activity extracted by the heparin-Sepharose 
columns were pooted and dialyzed overnight to 
remove NaCI in a 28.8 mm cylinder diameter 
Spectrapor No. 3 membrane tubing with cutoff 
at Z,5Q0 (Spectrum Medical Industries, Inc.. tos 
Angeles. CA.) against 30% acetic add. The re- 
tained fluid is centrifuged, as above, to remove a 
white precipitate, and the supematant is divided 
into equal portions for applying to 5 x 100 cm 
Sephacryl S-200 superfine columns (Pharmacia 
Fine Chemicate, Piscataway, NJ.). Each column Is 
etuted with 30% acetic acid at 20 ml for 22 min., 
and the column fractxsns are monitored by UV 
absorption at 280 nm and by bioassay. 

The bioassay-positive protein from ttw S-200 
columns is pooled and tyt)phiTiZBd. The lyophirized 
material is dissolved in 0.2N acetic acid (1 ml/ml) 
and filtered through a MtUex-HA 0.45 urn filter - 
(Milfipore Corp.. Bedford. MA.). The filtrate is ap- 
plied directly onto a 1 x 25 cm Vydac 5-um 
particie-size C4 colunrm (The Separations Group 
Hesperia, CA.) and dovetoped with a gradient of 
TEAP buffer. In the TEAP system, txjffer A cof>- 
sists of 0.2s N triethyiammonium phosphate pH 3. 
and buffer B is 80% acatonitrile in buffer A. ^ter 
all tfie filtrate had been loaded, the column is 
washed with the aqueous txiffer A until the UV 
absorption reached t^asetine. The fractions exhibit- 
ing inhibin activity are separated in a Beckman 332 
gradient liquid chromatography system (Beckman 
Instruments. lnc„ Berkeley, CA.) equipped with a 
Spectroftow 757 UV detector (Kratos Analytical Irv 
stiuments, Ramsey, N.J.), a Soltec 220 recorder - 
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(Soltec Corp,. Sun Valley, CA.) and a Redirac 2112 
fraction collector (LKB Instruments, Inc.. Gathers- 
burg, MD), Zones of inhibin activity are detected by 
bioassay. 

Inhibin protein containing the chain is fur- 
ther purified free of inhibin containing the 0m spe- 
cies, if desired, by two more RP-HPLC steps. The 
first step uses a 1 x 25 cm Vydac 5-ixm-partJCie- 
sizB C4 column and a trffluoroacettc add (TFA) 
buffer system and the second step employs a 1 x 
25 cm Vydac 5-am — particle-size Phenyl column 
and the TEAP buffer system. In the TFA system, 
buffer A contains 1 ml trifluoroacetic acid in 999 ml 
water and buffer B is 1 ml trifluoroacetic acid in 
199 ml water and 800 ml acetonitrile. The two 
inhibin spedes elute separately. Inhibin accumu- 
lated from a few batches was concentrated by RP- 
HPLC using a 0,46 x 25 cm Aquapore RF-300 10 
um-particle-size column (Browntee Labs., Santa 
Clara, CA.) and the TFA buffer system. Ordinarily, 
however, this purification step will not be used with 
cell culture supematants from transformants with 
DNA encoding only the 0a or 4b chains. 

Inhibin, acthrin, prodomain sequences or their 
variants are administered In the form of pharma- 
ceuticatty acceptable nontoxic salts, such as add 
additian salts or metal complexes e.g., with zinc 
iron or the Hke (which are considered as salts for 
purposes of this appGcation). Illustrative of such 
add addition salts are hydrochloride, 
hydrobromide. sulphate, phosphate. maJeate, ace- 
tate, citrate, benzoate, succinate; malate, ascor- 
bate, tartrate and the Hke. Intravenous administra- 
tion in isotonic saline, phosphate buffer solutions or 
the like is suitable. 

The polypeptide herein should be administered 
under the guidance of a physician, and pharmaceu- 
tical compositions will usually contain an effective 
amount of the peptide in conjunction with a con- 
ventionai. phanmaceutically-acceptat^le carrier. The 
dosage will vary depending upon the specific pur- 
pose for which the protein is t>eing administered, 
and dosage levels in the range of about 0.1 to 
about 1 miiligram per Kg. of body weight may be 
used when inhibin is administered on a regular 
basis as a mala contraceptive. 

Inhibin, activin, prodomain sequences or their 
variants desirably are administered from an implan- 
table or skin-adhesive sustained-release article. Ex- 
amples of suitable systems include copolymers of 
L-glutamic acid and gamma athyl-L-glutamate (U. 
Sidman fit ai^ 1983. "Blopolymers" 22(1): 547- 
556). poly (2-hydroxyerttiyl-methacrytate) (R Langer 
et al., 1981. "J. Biomed. Mater. Res." if: 167-277 
and R. Langer. 1982. "Chem. Tech." 12: 98-105) 
ethylene vinyl acetate (P. Langer et al., W.). or 



poly-D-(->-3-Hydroxybutyric acid (HP 133,988A). 
Such articles are implanted subcutaneously or are 
placed into contact with the skin or mucous mem- 
branes. 

5 In order to simplify the Examples certain fre- 

quently occurring methods wiU be referenced by 
shorthand phrases. 

Plasmids are designated by a low case p pre- 
ceded and/or followed by capita! letters and/or 
TO numbers. The starting plasmids herein are com- 
merdaJly available, are pubficly available on an 
unrestricted basis, or can be constructed from pub- 
lidy available plasmids or ONA in accord with 
published procedures, in addition, other equivalent 
75 plasmids are known in the art and will be apparent 
to the ordinary artisan. 

"Digestion" of DNA refers to catalytic cleavage 
of the DMA with an enzyme that acts only at certain 
locations in the ONA. Such enzymes are called 
20 restriction enzymes, and the sites for which each is 
spedfic is called a restriction site. "Partiar diges- 
tion refers to incomplete digestion by a restriction 
enzyme, i.e. conditions are chosen that result in 
cleavage of some but not all of the sites for a given 
25 restriction endonuclease in a DNA substrate. The 
various restriction enzymes used herein are com- 
mercially available and their reaction conditions, 
cofactors and other requirements as established fcjy 
the enzyme suppliers were used. Restriction en- 
30 zymes commonly are designed by abbreviations 
composed of a capital letter foltowed by other 
letters and then, generally, a numt)er representing 
the microorganism from which each restriction en- 
zyme originally was obtained. In generally, about 
35 lag of plasmid or DMA fragment is used with 
about 1 unit of enzyme In about 20 ul of buffer 
solution. Appropriate buffers and substrate amounts 
for particular restriction enzymes are specifiod by 
tt)e manufacturer. Incubation times of atxxit 1 hour 
40 at 37' C are ordinarily used, but may vary in accor- 
dance with the supplier's instructions. After incuba- 
tion, protein is removed by extraction with phenol 
and chk>roform, and the digested nudeic acid is 
recovered from the aqueous fraction by predpita- 
45 tion with ethanol. Digestion with a restriction en- 
zyme infrequently is followed with bacterizil alkaline 
phosphatase hydrolysis of the terminal S' phos- 
phates to prevent the two restriction cleaved ends 
of a DNA fragment from "circularizing'* or forming 
so a closed loop that would impede insertion of an- 
other DNA fragment at the restriction site. Unless 
otherwise stated, digestion of plasmids is not fol- 
lowed by 5* terminal dephosphorylation. Proce- 
dures and reagents for dephosphorylation are con- 
55 ventional (T. Maniatis et ^ 1 982, Molecular Clon- 
ino pp, 133-134). 
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"Recovery" or "isolation " of a given fragment 
of DNA from a restrictJon digest means separation 
of the digest an poiyacrylamide gel elec- 
trophoresis, identification of the fragment of interest 
by comparison of its mobility versus that of market 
DNA fragments of known molecular weight, re- 
nnoval of the gel section containing the ciesired 
fragment, and separation of the gel from DNA, This 
procedure is known generally. For example, see R. 
Lawn gt si,. 1981, "Nucleic Acids Res." g : 6103- 
6114, and 0. Goeddel 1980, "Nucfeic Adds 

Res.- fi: 4057, 

"Southern Analysis" is a method by which the 
presence of DNA sequences in a digest or DNA- 
containing composition is confirmed by hybridiza- 
tion to a known, labelled oligonucleotide or ONA 
fragment For the purposes herein, unless other- 
wise provided. Southern analysis shall mean sepa- 
ration of digests on 1 percent agarose, denaturation 
and transfer to nitrocellulose by the method of E. 
Southern, 1975. "J. Mol. Biol." gg: 503-517, afKl 
hybridization as described by T. Maniatts st SL-* 
1978, "Ceir 15: 687-701. 

Transfomiation" means introducing DNA into 
an organism so that the ONA is repticable, either as 
an extrachromosomai element or chromosomal in- 
tegrant. Unless otherwise provided, the method 
used herein for transformation of ^ coK is the 
CaCI, method of Mandei al- 1970, "J. Mol. Biol.' 
§2:154. 

'Ligation refers to the process of forming 
phosphodiestor bonds between two double strand- 
ed nucleic add fragments (T. Maniatis ai al- *d-, p. 
146). Uniess otherwise provided, ligation may be 
accomplished using known buffers and conditions 
with 10 units of T4 DNA (igase Cligase") per 0.5 
UQ of approximately equimolar amounts of the 
DHA fragments to be Ugated. 

•preparation" of DfsiA from transformants 
means isolating ptasmid DNA from microbial cul- 
ture. Unless otherwise provided, the alkaline/SOS 
method of Maniatis gt Id. p, 90., may be used. 

"Oligonucleotides" are short length single or 
double stranded polydeoxynucleotides which are 
chemically synthesized by known methods and 
then purified on pciyacrylamide gels. 

All citations are expressly incorporated by ref- 
erence. 



EXAMPLE 1 

Isotaition of Ctoned inhtbin o-Subunit cDNAs 

The strategy for identification of clones contain- 
ing coding sequences for the porcine inhibin sub- 
units was based on the "long-probe" approach, 
successful in some previous instances (Anderson 



et a[., 1983. "Proc. Nat Acad. Sci. USA" §0:683^- 
8842 and Ullrich et a(.. 1984. "Nature" 309:418- 
425). Briefly, a high-complexity cDNA library con- 
structed in xgtIO and derived from porcine ovarian 
5 mRNA by oligo-dT-primed cDNA synthesis was 
screened with a single 64-base-long synthetic 
oiigodeoxynucleotide directed against the N-termi- 
nal amino add sequence of the cr-chain of porcine 
inhibin. It was found that the library is to be pre- 
ro pared from fresh ovarian tissue because the inhibin 
chain mRNA was apparently quite labite. Approx>- 
mateiy 1 in 2.000 plaques hybridized with this 
probe, and sequence analysis of several hybridiz- 
ing cloned cONAs confirmed correct probe iden- 
75 ttfication. This analysis revealed that none of the 
characterized cDNAs contained suffident sequence 
information to predkzt the complete structure of the 
a-chain precursor protein. Rather than analyzing 
more ck>nes from the same cDNA library, a second 
20 library was constructed by 3' extension on ovarian 
mRfiA of a synthetic oligodeoxynucJootide com- 
plementary to a sequenced region encoding a pre- 
cursor residues 60-64 (Rg. 1A). This library was 
screened with a suitable restriction fragment from a 
25 previously analyzed cONA and yielded several iso- 
lates which specified the remainder of the DNA 
sequences encoding the fvf-tenminaf region of the a 
precursor. Completeness of the coding sequence 
was judged from the presence of a long reading 
30 frame vrhtch specifies the pordne o-chain peptide 
sequence ar^j starts wfth a methionine codon pre- 
ceded by an in-frame stop codon and followed by 
a hydrophobic sequence tearing the hallnr^arks of a 
signal peptide. The full sequences for the precursor 
35 protein and its cDNA are shown in Fig. IB. The 
complete protein induding signal peptide has an 
Mr of "40K consisting of 364 amino adds, of which 
ttie C-terminat 134 (Mr "14,5K) constitute the por- 
cine inhibin a-chain. There are several Arg-Arg 
40 sequences in the pnxegion of the precirsor, one of 
them directly preceding the a subunit. We believe 
that this latter pair of basic residues is the process- 
ing site for the proteolytic release of tt)e « peptide. 
The deduced precursor sequence predicts two N- 
45 linked glycosylstion sites, one within the a chain 
proper. 

In addition to the coding region, the cONA 
sequence contains a 3'-untrsn3lated sequence of 
167 nucteotides, including tt)6 carxxiicaf AATAAA 
50 polyadenyiation signal, and a S'-untranslatsd re- 
gion, the proper length of which is presently un- 
known. 

The detailed method was as follows: 
Polyadenytated mRNA was prepared from 
55 freshly frozen porcine ovaries (Kaplan "J- 
Biochem." 1]^: 181-184) An oligo-dT-primed cDNA 
library of "8X10* ctones in XgtiO (Huynh gl al-» 
1984. DNA Clonino Techniques .. Ed. 0. Clover) 
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was prepared form 5 ug poiyA+ mRNA as de- 
scribed by Wood 2t at., "Nature" 31£: 330-337 - 
(-1984), except that the Eco RI adaptors used had 
the sequence 5'-AATTCACTCGAGACGC-3' 
3'-GTGAGCTCTGCG-5'P. 

Approximately 1 x 1 0' unampiified cD^4A 
clones were screened with 5 o-subunit 
oligonucleotide 5'-ACCGCCCCI i iGCCTTGGCCT- 
TGGTCCCCTGCTGCTCTGAGACTGCTGCAGAGA- 
CCTCCTGAGG-3', based on the amino acid se- 
quence underiined in Fig. IB. Hybridization was 
carried- out witti the phosphorylated **P-iabelled 
probe in SxSSC, 40% formamide at 37 'C. Ftit&rs 
were washed at 50 •C wrth IxSSC, 0.1% SDS. 
Approxim'ately 500 hybridization positive clones 
were obtained, twelve of which were purified and 
examined for insert size. The BcoR l inserts of five 
of these PiP!N-a2, -«x5A, -aS, -ei9, -alO) were sub- 
cloned into Ml 3 derivatives (Messing et aJ., 1981 
"Nucl. Adds Res.* 9:309-321) and sequenced by 
the dideoxy chain termination method of Sanger 
ai./Proc. Nat Acad. Sci. USA" 74:5463-5467 - 
(1977). A spedfically primed library was prepared 
by priming 5 ug of potyA* m RN A witii the 
oligonucleotide 5'-CCCCACAGCATGTCTT-3' - 
(complementary to nucleotides 248-263) and suti- 
sequent cloning into XgtlO. Approximately 2x10* 
clones of the 1x10* clones obtained were screened 
with the 5* lOObp Eco RI- Sam Ht fragment prepared 
from XPIN-a2. Twelve of the 170 hybridization posi- 
tive clones obtained were purified and two (XPIN- 
Sl2s, -S4s) were sequenced by the dideoxy meth- 
od. The complete nucleolidd sequences of the a* 
subunit cDNAs were obtained by subdoning var- 
ious restriction fragments with the different X iso- 
lates into the Ml 3 phage derivatives. Compres- 
sions were resolved by the use of deoxytr^osine 
mixes in combination with the E, coii single strand- 
ed tunding protein (Pharmacia). 



Isolation of Cloned inhibin Subunit cONAS 

The cDNA sequences encoding the precursors 
of the inhibin subunits were obtained from the 
same cONA libraries used for the a subunit Over- 
lapping cONA clones were isolated by screening 
first with single long synthetic otigodeoxynucleotide 
prot>es based on the two N-terminal ^ suixjnit 
sequences and subsequently with suitable restric- 
tion fragments derived from characterized cDNA 
clones which served as probes for -walking" in 
tjoth 5' and 3' directions (Rg. 2A). 

In more detail, approximately 2x10* oligo-dT 
primed ovarian cDNA clones were screened with 
the 5* end lat>elled ^AOligonucleotide, 5*-AAGAAG- 
CAGTTCTTTGTGTCCTTCAAGGACATTGGCTGG- 
AATGACTGCSATCATTGC-3* based on the amino 



acid sequence of residues 321-339. Rve hybridiza- 
tion positives were obtained, of which three proved 
to contain fi^ coding sequences (XPIN-4a2. -^a-*, - 
^aB). a 5, end 154 bp EcoRI-Hindlll (nucleotides 

s 158-297) fragment and a 3' end 213 bp Eco Rt-Pst 
fragment (nucleotides 1 679-1892) derived from 
XPIN^A2 were used to screen 2x10* oligo-dT prim- 
ed cONA clones and 2x10* clones from the cr-chain 
specifically primed Bbrary. Out of the sixteen 

TO clones analyzed in detail two were fourxi to have 
longer 5* ends (XPIN-jSaSs, -^a22) and one clone 
XPIN-i5A2l contairted the entire 3*-untranslated re- 
gion. Porcine inhibin /Sb subunit cDNA clones were 
isolated by screening 2x10^ clones from the spe- 

75 cificaliy primed library with the oligonucleotide 
5*- 

GGCCTGGAGTGTGATGQGAGAACCAACCTGTO 
CTGCCGCCAGGAATTTTTCATCGATTTCAGGCT- 
3', which was based on the NHj-terminai sequence 
20 described in Fig. 1A. Posrttve clones were further 
screened with the oligonucleotide inosine protje 5'- 
AAiTCTATlAAIAA"E TG -3* (T in this sequence 
stands for inosine). which covers all the possibili- 
ties in the non-coding strand for the amino acid 
2S sequence QQFRDF. Two donas (XPIN^Sb-Is, -2s) 
were isolated and sequenced and found to code for 
the subunit A 230 bp Eco Rl -Sma (nucleotides 
21-251) fragment was isolated from XPIN^H and 
used as a hybridization probe to screen 2x10^ 
30 oiigo-dT primed cDNA clones. Two positives were 
obtained (XPIN^b-3.4). The nucleotide sequefK» of 
these overlapping clones was used to construct ttie 
sequence shown. All sequences were obtained by 
sutx^ontng specific fragments into Ml 3 phage vec- 
35 tors (Messing fij al., 0£ cij.). The Eco RI restriction 
sites referred to above are all contained within the 
cONA adaptor fragment arxl do not refer to se- 
quences present in the cDNA. 

We noted that only very few clones from the 
40 oligo-dT-primed library (4 out of 2x10') hyt»idized 
with the synthetic probe for the ^-subunit of inhibin 
A- Although most of these proved correct by OUA 
sequence analysis, none contained a full 3'-untran- 
slaled region, as judged by the absence of a polyA 
45 homopolynier at their 3* ends. Absence of polyA 
tails suggested the existence of a very long 3*- 
untransiatod sequence in this mRNA species 
and/or structural region(s) which prove difficult to 
copy by the polymerases used for library construe- 
so tion. Unexpectedly r a higher abundance (-lO-foW) 
of inhit>in j3a subunit coding sequences was found 
in the cDNA library made by specific primir^ on ar 
subunit mRNA. This library was screened with the 
synthetic probe for the ^-chain of inhibin A on the 
55 subsequently refuted theory that the a precursor 
mRNA might also encode the 0 subunit The high 
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abundance of inhibin 4a cDNA in this library was 
!at©r trac&d to fortuitous compiementarfty of the 
specific a chain primer to a region in the 3*- 
untranslated portion of the corresporxfing mRNA. 

Only four cloned cONAs encoding the ^ sub- 
unit of inhibin B were found in our libraries. The 
sequence information otrtained from these clones 
failed to reveal the complete structure of the cor- 
responding precursor protein and its cDNA. The 
sequences of cDNAs and deduced protein struc- 
tures for the precursors of the 0 subunits are 
compared in Rg. 2B. The nucleotide sequence of 
inhibin fi^ subunit cONA is 3.6 kb in length and 
contains an open reading frame for a protein of 425 
amino acids (Mr "46K). the C-terminal 116 residues 
of which represent the 0 subunit proper (Mr ~13K). 
This reading frame begins with a methionine codon 
followed by a sequence that codes for a char- 
acteristic signal peptide, the true length of which is 
t)eiteved to be 29 residues. The encoded 0 sut^unit 
is preceded by a string of 5 arginines at which it is 
presumably proteotyticaiJy cleaved from the precur- 
sor. Similar to the a subunit precursor, .this 0 
precursor contains several additional pairs of basic 
residues at which hitherto unknown t>iologic3ily ac- 
tive peptide entities are believed to be released, tt 
also contains one possible site of ^Minked 
glycosylation in the proregion (Asn. residue 165). 

The deduced protein sequer)ce for the 0 sub- 
unit of inhibin B shows high homology with the 0a 
sutxjnit sequence. 71 amir>o acid residues are 
identical and most changes are conservative in 
nature. Sequence homology, although of a lesser 
degree, is also found in the proregion of both 0 
subunit precursors. Interestingly, an extremely 
purine-hch sequence rarely seen in coding regions 
but present in the cONA encoding the ir^ibin 0^ 
precursor and resulting in a curious amino acid 
sequence is not found in the cONA which codes for 
the homologous 0b precursor. This results in a gap 
of 22 amino acid residues from the precursor of 
Inhibin when protein sequer>ces are aligned for 
maximal homology. Such alignment also brings 
about a perfect match in the cysteine positions of 
both precursors (see Fig. 2B). 



trophoresed into a formaldehyde 1.2% agarose gel 
and blotted onto nitro-c llulose filters. The following 
^P-labelled cDNA fragments were used as hybrid- 
ization probfts under stringent conditions. Lane a; 
5 240 bp EcoRI- Sma t (nucleotides 134-371) from a 
subunit cdNA; b: 154 pb Eco RI -Hind lll (nucleotides 
158-297) from j8A subunit cONA; c: 230 bp Ecq Ri- 
Sma (nucleotides 21-251) from 4b subunit cDNA; d 
and e: ScoRt insert of XPIN-o2; f and g: EcoRI 
70 insert of XPIN-^aS. Riters were washed for 2 hours 
with 3 c^wnges of O.lxSSC, 0.1% SDS at 60«C. 

Analysis showed (Frg. 3) that cr and 0 mRNAs 
are of differern size and abundance, as indicated 
by results obtained from cDNA cloning. From their 
75 respective barxJ Intensities the a precursor mRNA 
is estimated to be at least of 10-fold higher abur>- 
dance than the mRNA for the 0^ precursor, and 
approximately 20-fokj higher than the mRNA for 
the 0B precursor. 
20 Using ribosomal RNAs as size standards, the a 

precursor mRNA which is a single species, is 1500 
nucleotides in length, a size in good agreement 
with tiie cloned cDNA sequence (Fig. IB). 0x pre- 
cursor mRNA S6quer>ces are represented by two 
25 main species of "4.5 and •7.2 kb in length. The 
reiativeiy higher intensity of both species in 
polyadenylated than total RNA suggests that the 
4.5 kb species does not represent 283 RNA which 
hybridized to the cONA probe. Thus, the 0 prscur- 
30 sor cONA sequences shown in Fig. 26 are thought 
to represent the 4.5 kb mRNA. suggesting that tiie 
5' untranslated region for the 0a mRNA is approxi- 
matety 900 nucleotides long. The 0b precursor is 
encoded on one mRNA. of approximately 4.5 kb in 
3S size, which is present at roughly half the level of 
the two 0A mRNAs. Since the two 0 mRNAs are 
ctosely related, one can predict that both mRNAs 
have a simitar structure and tiius the 0^ mRNA 
presumably possesses a long 5* and 3* untran- 
40 stated region equivalent to that shown for the 0a 
mRNA. Choice of a different potyadenytation signal 
might explain the existence of the 7.2 kb speo'as. 



45 Homology To Transforming Growth Factor-j5 



Northern Analysis of a and 0 chain Precursor 
mRNAS 

Ovarian total and polyadenylated RNAs were 
analyzed by the Northem procedure using the 
sequenced cDNAs as probes to assess size and 
relative abundance of the mRNAs which encode 
the peptide subunits a and 0 and of the 
heterodimeric inhibin molecule. Polyadenylated 
mRNA (2iLg: lanes a, b, c. and f; 8 ag: lane d) and 
total RNA (10 itg: lanes e and g) were elec- 



The mature a and 0 inhibin subunits contain 
seven and nine cysteine residues respectively. 
Upon alignment of the cysteine residues it is ap- 

50 parent that the two subunits share a similar 
cysteine distribution and some sequence homok>gy 
exists around these residues (Fig. 4). suggesting 
that both types of subunits derive from orre ances- 
tral gene. Surprisingly, significant homology was 

55 found between the 0 chain and the primary struc- 
ture of human TGF-4 recently determined. As out- 
lined in Fig. 4, both peptides are of nearly equal 
length (inhibin 0a subunit 116, 0s subunit 115; 
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TGFS, 116 residues) and show a strikingly similar 
distribution of their nine cysteine residues. Using 
this cysteine "skeleton" for alignment the 4* and 
TGF-^ sequences have an additional 31 residues 
in identicai positions and show conservative 
changes in nine homologous places. Similar high 
homologies are seen upon comparison of the 
and ^-TGF. Some gaps were introduced for better 
alignment (Rg. 4). The overall homology reaches 
35%. but approaches 60% in certain sections (cf. 
porcine inhibin 4a chain residues 11-45 and TGF 
residues 15-49), a very high degree of homotogy 
considering the difference in species. Interestingly, 
this homology extends beyond the termination 
codon for protein synthesis in the respective 
cDNAs. Thus, the cDNAs for TGF-;ff and both in- 
hibin 0 subunits contain a highly G and C rich 
sequence in this region, and they also possess 
unusually long 5* and 3* untranslated regions. 

One can discount the suggestion that the 0 
subuntt of inhibin is the porcine equivalent of hu- 
man TGF-4, since there is almost absolute homol- 
ogy between human and murine j9-TGFs, These 
findings strongly indicate that both inhibin subunfts 
and TGF-j9 have a common ancestor and betong to 
one gene family. All three peptides are derived 
from ^miiarly-sized precursors (M,. -40K) where 
they occupy the C-terminal 110 or so residues and 
are released by proteolytic cleavage at pairs of 
arginines. They form homo-or heterodimers, and 
subunits in the biologically active complex are 
linked by disulfide bridges- However,- there. is- Itttia • 
sequence homology between TGF-4 and the j3 
subunits in the pro parts of their precursors, al- 
though the regions comprising the odd residues 
which precede the ff suburut and TGF peptides 
display limited but significant sequence related- 
ness. 



EXAMPLE 2 

Recombinant Synthesis of Porcine Inhibin 

The ptasmid used for recombinant synthesis of 
porcine inhibin was pSVE-PorB Ainh-DHFR The 
procedure to construct this plasmid is shown in 
Fig. 5. This plasmid was constructed as follows: 

pHBS348-E (BP 0073856A) was partially di- 
gested with Ecq RI, blunted with £. £oii DNA pxjly- 
merase I (Klenow fragment) and the four dNTPs, 
Ggated and the ligation mixture was transformed 
into E. coH in 294 (ATCC 31446). The transformed 
culture was plated on ampicillin media plates and 
resistant colonies selected. Ptasmids were 
screened for the loss of the Eco RI site preceding 
the SV40 eariy promoter. A plasmid having the site 
deleted is referred to as pHBS348-Ell. 



pHBS34S-Eil was digested with EcoRl and 
Eco RI to produce two fragments, fragment I con- 
taining the SV40 early promoter, pmL-Amp'' se- 
quences and the HBsAg 3' untranslated region and 
5 fragment 2 containing the HBsAg (hepatitis B anti- 
gen) coding sequences. 

XPIN)5a5s containing the coding region for the 
porcine inhibin ^a subunit was digested with EcoRI 
and Smal and the 1335 bp fragment (fragment 3) 
10 containing the fi^ coding region recovered fay 
poiyacrylamide gel electrophoresis. Fragment I. re- 
covered by agarose get electrophoresis, was ligat- 
ed to fragntent 3 and the ligation mixture trans- 
fonned into g. ssS. strain 294 (ATCC 31448). The 
Ts transfonned culture was plated on ampicillin media 
plates and resistant colonies selected. Plasmid 
DNA was prepared from transfe r mants and 
checked by restriction analysts for the presence of 
the con-ect DNA fragments. This plasmid is re- 
20 ferred to as pSVE-p^Alnh. 

pHBS34d-E (EP 0073656A) was partially di- 
gested with Eco RI. blunted with £. call DNA poly- 
merase I (Klenow fragment) and the four dNTPs, 
ligated to the synthetic oligonucleotide 5* 
25 GGTCGACC-3* containing the Sai l recognition site. 
The transformed culture was plated on ampicillin 
media plates and resistant colonies selected. Plas- 
mids were screened for the presence of the extra 
Sai l restriction site. Plasmid DNA is prepared from 
30 this construction (pHBS346-ESaiI). 

XPINa-l2s and XPINa-2 were digested with 
EcoR I and Bam HL A 104 b oEcoR I- Bam HI fragrr^ent 
from XPINa-12s containing tfie 5' coding region 
and a 1246 bp Eco Rt -BamH I fragment from XPtNo- 
35 2 containing the middle and 3' codir>g region were 
recovered and ligated together. The ligation mix- 
ture was digested with Eco RI. Ihe enzyme heat 
denatured, and the mixture figated to Eco Rt-dt- 
gested pUC9 (BFIL). Recombinants were selected 
40 and confirmed by restriction analysis. DNA was 
prepared from the correct plasmid (pPINor). 

pPINa. containing the complete coding region 
for pordne a-inhibin was digested with Nco l and 
EcoRI. filled in by Pol(l)K in the presence of 
45 4dNTP*s, tt>e 1280 bp fragment (fragment 4) was 
recovered by gel electrophoresis. pHBS348-£SalI 
was digested with Sstll and Hindlll, filled in by Pol- 
(l)K in the presence of 4dNTP's, and fragment 5 
containing the PML-Amp' region, SV40 earty pro- 
50 meter and HBsAg 3* untranslated region was re- 
covered by gel electrophoresis. Fragments 4 and 5 
were ligated together and the ligation mixture used 
to transfonr^ g. saR 234 (ATCC 31446), Reconrw 
binants were selected by growing on Ampicillin 
55 media piales. The desired recombinant is called 
pSVE-Palnh. 
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pHBS34S-£SaiI was digested with §ail and 
Hindlll and fragment 6 containing the pML-Amp'', 
and SV40 early promoter was recovered by gel 
eiectrophoresis. pFD 11 (EP 117.060A) was digested 
with Sal I and Hindlll and fragment 7 was recov- 
ered which contains the normal mouse DHFR gene 
fused to the HBsAg 3* untranslated region. Frag- 
ments 6 aruj 7 were iigated. and the ligation mix- 
ture transformed into £. £2il strain 294 {ATCC 
31446). The transformed cutture was plated on 
ampicillin madia plates and resistant colonies se- 
lected. Plasmid DNA was prepared from transfor- 
mants and checked by restnction analysis for the 
presence of the correct DNA fragments. This con- 
struction is referred to as pFDII-SalL 

pSVE-Pcrlnh was digested with Sail and frag- 
ment 8 was recovered which contains the SV40 
early prcxnoter and the o-inhibin coding region 
fused to the HBsAg 3'-untranstated region. pFDlt- 
Sall was digested with Sail and fragment 9 contain- 
ing the SV40 early promoter and the mouse DHFR 
coding region linked to the HBsAg 3*'urTtranslated 
region was recovered. pSVE-)5Alnh was ftnearized 
by Sai l digestion and tigated to fragments 8 and 9 
in a three part Hgation. The ligation mixture was 
transformed into£. soil strain 294 (ATCC 31446). 
The transformed culture is plated on ampciliin me- 
dia plates and resistant cotonies selected. Transfor- 
mants were screened for the presence of frag- 
ments 8 and 9 in the correct orientation such that 
transcription from the three SV40 early promoters 
wiH proceed in the same direction. This ffnal plas- 
mid is designated pSVE-Pa^Alnh-DHFR. 

Plasmid pSVE-Pa^xInh-DHFR was transfected 
into DHFR deficient CHO cells (Urtaub and Chasin. 
1980, PNAS 77,4216-4220). However, any DHFR 
mammalian host cell is suitable for use with this 
plasmid. Altemativety, any mammalian host cell is 
useful when the host cell is cotransformed with a 
ptasmtd ortcoding neomycin resistance, and trans- 
fomiants identified by their ability to grow in 
neomydn-containing medium. 

The transfected CHO cells were selected by 
cultuhng in 15 HGT~ medium. The cells were aJ- 
towed to grow to conf}uency in 15cm diameter 
plates. The cells thereafter were cultured in serum 
free medium for 48 hours prior to harvest 50ml of 
supematam medium was lyophilized after the addi- 
tion of 100mg human serum albumin. The residue 
was redissotved in 3mi 1% fetal txjvine serum in 
HDMEM (GIBCO Laboratories, Santa Clara. CA), 
flltdred through a Millex-GS 0.22mM filter (Milfipore 
Corp., Bedford^ MA) and assayed in duplicate. 

The inhibin hormonal activity in the transfor- 
mant supematants was determined by an in vitro 
bioassay using rat anterior pituitary morKjiayer cuK 
ture. Vale, W, §t al, Endocnnoloov . 562-572 - 
(1972). In brief, 21 -day-old female rat anterior 



pituitaries were collected, enzymatically dispersed 
and plated in 10% fetal bovine serum in HDMEM • 
(GIBCO Laboratories. Santa Clara, CA) into 24-well 
tissue culture plates (Falcon Plastic, Oxnard, CA) 
5 on day 1, On day 2, the medium was changed to 
1% fetal bovine serum in HDMEM, and the trans- 
formant medium sample was added. Incubation 
was continued for another 48 hours. The monolayer 
medium was then harvested, and the LH and FSH 
TO contents were determined by radio-immunoassay 
. (RIA) using materials provided by The PitiMtary 
Honnone Program of NIADDKD, in this assay, the 
inhibin-containing CHO cell culture inhibits the teas- 
el release of FSH but not LH, as compared to 
7S control pituitary celis that received the incuisation 
medium only. The amount of porcine inhibin de- 
lected in transformant supematants was 20 ng/ml 
and exhiljited a dose response curve parallel to 
that ot>tained with pure pordne ovarian inhibin. 
20 Immunological cross-reactivity is assayed by a 

sandwich-type radioimmunoassay. RabtMt antisera 
are raised against purified porcine follicular inhibin 
by S.C. immunization of rabbits with the porcine 
inhibin in Freund's complete adjuvant. The pres- 
25 ence of anti-inhibin in the antiserum is detected by 
incut>ation of the antiserum with purified porcine 
inhibin and assaying for the formation of an im- 
mune complex by conventional techniques, e.g. gel 
filtration. An aliquot of the antisera is coated onto 
30 goat-anti-rabbit IgG precoated polystyrene test 
tubes. The recombinant culture supernatant or ex- 
tract (s diluted- into phosphate- buff ered saline and- 
added to the coated tubes, incubated overnight and 
washed. Another aliquot of the rabt>it antiserum is 
35 added to the test tubes, incubated and washed. 
Radioiodtnated goat antirabbit IgG is added to the 
tut>es. incut^ed and unbound goat antiserum re- 
moved by washing. The recombinant produced in- 
hibin cross-reacts with the ratJbit antiserum, as 
40 evidenced by bound counts on the test tubes 
which exceed those of controls incubated with cul- 
ture medium or extracts from untransformed host 
cells. 

45 

EXAMPLE 3 

Construction of Human Inhibin Vector and Expres- 
sion of Human Infilbtn in Recombinant Cell Cufture- 
50 I 

Expression of human inhibin fadtitated t>y 
the discovery that the mature porcir>e and human 
chains are identical. Thus, construction of a 
55 vector for the expression of human inhibin can 
proceed from plasmid pSVE-^A-lnh from Example 
1 , which contains the porcine ^A-encoding cDNA. 
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A Xgt 10 library of human ovanan cDNA made 
from 10 ug of ovarian mRNA was subjected to 
Southern analysis using radiopliosphate labelled 
porcine cDNA encoding a. and 4b chains. 
XHINcr-2 was identified as containing coding re- 
gions for the human a inhibin chain. The prev- 
alence of hybridizing clones in the case of human 
a inhibin was considerably less than that found for 
porcine a inhibin, on the order of 1 in 100.000 
human clones hybridized to tt>© 685 bp Sma i frag- 
ment of the porcine cDNA for alnh. The fi chain 
clones were also rare, with the clones being 
present at about 3 times the level of jSa (1 and 3 
out of about 1,000.000 clones, respectively). None 
of the 4 chain clones were full length. They were 
supplemented wfth a primed cDNA library and as- 
sembled generally as described above for the por- 
cine cDNA, The X inserts were recovered by Eco RI 
digestion. 

Plasmids pHlNo-S is digested with Nco l and 
Smal . and the 1049 bp 15 fragment (fragment 10) 
is recovered by gel eiectrophoresis. pPina 
(Example Z) is digested with Eco Rl and Pvu U. The 
98 bp fragment (fragment 11) is recovered by gel 
electrophoresis. Fragments 10 and 11 are ligated 
to adaptor I 5'- 

CTG(jrCCTCTTGCTGTTG6CCCCACGGAGTGG- 
GCATGGCTGCCAGGGCCCGGAGCTGGACC-3'. in 
combination with adaptor 11 which is the comple- 
ment of adaptor \, The resulting 1208 bp fragment - 
(fragment 12) is treated with Ktenow fragment of 
Poi(l) and the -4 dNTP's ^ ttgated to- pHBS348- 
ESall which has been restricted with HindlH and 
Sac lll and blunt-ended as described in Bcan^ple 1. 
Altemativeiy, pPina was digested with Eco RI and 
Hoa il with the fragment encoding upstream from 
the Hpa tl site (that is, the first 21 residues of the 
pordne sequer)C8) being recovered. Tbe adaptor 
used in this alternative approach was 
5'CGGAGCTCGACC 3' 
3* CTCGAGCTGG ^. 

A piasmid pSVE-Halnh having the con^ct orienta- 
tion of fragment 12 is identified by sequence analy- 
sis of transformants. This construction (pSVH- 
Halnh) thus contains the first 24 residues of the 
porcine signal sequence with the remainder being 
prepro human inhibin. Piasmid pSVE-Halnh is di- 
gested with Sai l. The fragment containing the SV40 
promoter and human inhilstn sequence is ligated to 
fragment 9 and §ali digested pSVE-jffAlnh (Example 
2). This final piasmid deagnated pSVE-hff;?Alnh- 
DHFR1 is transfected into DHFR-deficient CHO 
cells and selected as described in Example 2, The 
culture supernatant contains hormonaity active hu- 
man inhibin. 



EXAMPLE 4 

Construction of Human Inhibin Vector and Expres- 
sion of Human inhibin in Recombinant Cell Cutture- 

5 (I 

This example is similar to Example 3 except 
that Itie pro sequence of human inhibin "vvas 
employed in the place of the porcine fis prepro 
70 domain. 

The lamtxfa gtIO library of Example 3 yielded 
>J4INa2. as descrit>ed in example 3, together with 
XHIN^A-5 and -14. The latter two phage were em- 
ployed to construct the full length /3a coding cDNA 
T5 by ligating the 311 bp Eco R hHind III fragment - 
(fragment 13) of XHin^A-5 to the 1100 bp Hind lll- 
Hpa l fragment (fragment 14) of XHIN^a-14 and 
ligating this mixture in an Eeo Rl- Sma l digested 
mp18 vector (Biolabs). Clones were selected and 
20 screened for the appropriate sized insert An mpl6 
vector containing the correct insert was treated with 
ONA polymerase{l) and the four dNTPs in order to 
render it double stranded, and thereafter digested 
vrith Xba i (which cleaves in the mp18 polylinker 
25 sequBnce)» Ijiunted with DNA polymerase I and the 
four dNTPs, and digested wH^ Eco RL A 1320 bp 
fragment (fragment 15) was ligated to the Eco Rt- 
5S2^ fragment 1 from Example 2. This ligatioo 
mixture was used to transform E. ^fi 294 cells, 
30 Ctones were screened by Southern Hybridization 
and coftfirmed by restriction analysis. The done 

containing the hlnh^^ coding sequence was dee- - 

ignated pSVE-hum^JAlnh. A piasmid containing the 
human coding sequences and the human er- 
as inhibin sequences together with tfie DHFR gene is 
constructed from plasmids pSVE-hum^Alnh, pSVE- 
Holnh and pFOitSall as outlined above. Speoftcally. 
the Sai fragments from pSV&^Halnh and pFDUSalt 
which contain the human alpha inhibin and the 
40 DHFR genes were ligated with Sail digested pSVE- 
hum^Alf^h and a clone containing all three genes 
was Identified. This piasmid, designated pSVE- 
huma^Alnh-DHFR2 was transfected irrto DHFR- 
CHO cells and selected by culture in ghf medium. 
4S 24 clones were picked, grown to confluency in ghr 
medium under conditions converttional for CHO 
cells for two days, allowed to rest for 2 more days 
and thereafter the culture media were assayed for 
inhibin and activin activity using the rat pituitsry 
50 cell assay descrit>ed above. 4 clones were found to 
secrete significant levels of human a^A inhibin 
(hajSA-8, 12. 14i and 18). The levels In the culture 
medium for each clone were, respectively. 125. 
125, 200 and 250 ng/mi. Another clone (ho^A-H) 
55 produced activin as the ;3a^a homodimer. but no 
detectable inhibin, as determined by biological ac- 
tivity and the lack of a chain immunoreactivrty in 
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the culture medium for this clone. Clone 
secreted only a chain and was devoid of activin or 
inhibin activity. 

EXAMPLE 5 

Recombinant Expression of Human Activin . 

As reported by Vale ej. §1. (Id.) and Ung si al- - 
(Id.), homodimers and heterodimers of the ^ chains 
A and/or B have the opposite effect of inhibfn on 
the pituitary, introducing rather than inhibiting F5H 
secretion. Tnese proteins, collectively termed ac- 
tivin. are made in o and 0 chain cotransformants as 
described in Example 4. However, somewhat less 
screening for an appropriate transformant is need- 
ed if the initiat transfection is conducted with a 
vector or vectors that do not contain the a chain 
0ene. A suitable vector is readily constructed from 
the above-described vectors by excising the a 
chain gene. Plasmid pSVE-humjSAinh from Exam- 
ple 4 is digested with ^1 and ligated to fragment 
fl (Example 2) containing the DHFR gene. The 
ligation mixture was used to transfect £ 22£ 294 
ceils and colonies selected on the basis of failure 
to hybridize to the a chain sequence but which did 
hybridize to the 0 chain DNA. A done pSVE- 
hum^A'nh-DHFR was identified from wttich the a 
chain ONA has been deleted. This clone is trans* 
focted into DHFR- CHO cells as described above. 
Transformants are identified that secrete activin 
irrto the culture medium. Similariy, an expression 
vector containing a coding sequence - 

(reconstituted by ligating DNA encoding the first 34 
amino acids of human to the remaining coding 
sequence of the human /3b chain) is readily cor>- 
structed and contransfected wrth pSVE-humi3Alnh- 
DHFR to produce the heterodmer. The recon- 
structed human gene also is used in the for- 
going plasmids in order to produce a^r-in^ibin 
which, in the in vitro bioassay has essentiafty 
equivalent biological patency to the afi^ form of 
inhibin. 

Although the invention has t^een described with 
regard to its prelen-ed embodiments, which con- 
stitute the best mode presently krK>wn to the in- 
ventors, it should t>e understood tt^ various 
changes and modifications as wouid be obvious to 
one having the ordinary sWII in the art may be 
made without departing from the scope of the 
invention which is set forth in claims appended 
hereto. 



Ctainns 

1. A method comprising (a) constructir>g a vec- 
tor which includes nucleic acid encoding inhibin a 

5 chain and/or an inhrbin 0 chain (b) transforming a 
host cell with the vector, and (c) culturing the 
transformed cell. 

2. The method of claim l wherein the nucleic 
acid encodir>g the inhit>tn a chain and/or $ chain is 

70 operably linked to a promoter recognized by the 
host cell and irv::iudfng the further step of recover- 
ing inhtt^n or a >fl chain dimer from the culture 
medium. 

3. The method of claim 1 or 2 wherein the cell 
75 is a prokaryote. 

4. The method of any preceding claim wherein 
the vector comprises nucleic acid encodirig the 
prepro forms of totS\ the \r\f\\t3kt\ a chain and an 
inhibin $ chain. 

20 5. The method of claim 1^ or 4 wherein the 

cell is a cell from a mufticellular organism and 
hormonaliy acthm inhibin is produced. 

6. The method of claim 2 wherein the promoter 
is a viral promoter. 

25 7. The method of claim Q wherein the promoter 

is an SV40 promotor. 

a. The method of claim any preceding claim 
wherein mature porcine or human inhibin is recov- 
ered. 

00 9. The method of any of claims 1 to 3 or 5 to 7 

wherein the vector comprises nucleic acid encod- 
ing the prepro form of an inhibin fi chain and a 
mature ^-chain dimer is recovered free of the a- 
chain. 

35 10. The method of claim 5 or B wherein the 4 

chain is the 4a chain and the inhibin is present in a 
concentration greater than about 20 ng/ml in the 
culture medium. 

11. The method of any preceding daim 
40 wherein the nudoic a^d encodes an amino add 

sequence variant of the inhibin a and/or fi chain. 

12. The composition comprising human or por- 
dne inhitjin which is completely tree of unidentified 
or pordne proteins. 

4S 13, A composition comprising an a or in- 

hibin prodomain unassodatad with native 
gfycosyfation. 

14. A compositton comprising a homodimer of 
mature inhibin 4a or 0^ chains, which composition 

so is free of the inhibin a chairu 

15. A composition comprising a heterodimer of 
mature inhibin 4a with mature inhibin 4b* which 
composition is free of the inhibin a chain. 

16. Non chronrKJSomal DNA encoding inhibin a 
55 or an inhibin 4 chain. 

17. The DNA of claim 16 which is free of 
intervening untranslated sequences. 
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18, The DNA of ciaim 16 cr 17 which encodes 
porcine or human inhibin a or ^ chain. 

19, The DNA of claim 16.17 or 18 which is 
labetied with a detectable moiety. 

20, The DNA of ciaim 16,17,18 or 19 which 5 
encodes an amino add sequence variant of the a 

or ^ Cham. 

21 - A replicable vector comprising DNA encod- 
ing an inhibin a and/or chain. 

22. The vector of claim 21 wherein the DNA io 
encodes porcine or human inhibin a and/or ^ 
chain. 

23. The vector of claim 22 comprising a viraJ 
promoter operably liniced to the DNA encoding the 
inhibin a and/or 0 chains. 75 

24. The vector of claim 21,22 or 23 which 
contains DNA encoding both inhibin a and an in- 
hibin 0 chain. 

25. The vector of claim 21^ or 23 which 
contains DNA encoding an inhibin 0 chain but not 20 
the inhibin a chain. 

26. A host cell transformed with a replicable 
vector comprising DNA encoding the inhibin a 
and/or an inhibin 0 chain. 

27. The cell of claim 26 which is a eulcaryotic 2S 

celt. 

28. The cell of claim 26 or 27 wherein the DNA 
encodes pordne or human inhtbtn a and/or 0 
chain. 

29. A cell-free composition containing an in- 30 
hibin a chain prodomain sequence that is free of 
nnature a chain sequence. 

30. A cell-free composition containing 

a) a polypeptide comprising the inhibin 0a 
chain prodomain sequence HSAAPOCPSCALAALP- 35 
KDVPNSQPEMVEAVKKHILNMLHL, 
PDVTQPVPKAALLNAIRKLHVQKVGeNGYVBEDDl- 

G, AEMNELMEQTSEHTFAESGTARJCOHFBSKEG- 
SDLSWB^AEVWLRKVPKANRTRT- 
KVTIRLFQQQKHPQGSLDTGEEAHEVGLKGERSE- 40 
LU.SEKWDA. STWHVFPVSSSIQFUJLDQGKSSLD- 
VRIACEQCQESGASLVLLG, or naturally occurring 
mammalian amino acid sequence variants thereof; 

b) a polypeptide comprising the inhibin 

chain prodomain sequence CTSCGGFRRPEELGR- 4s 
VDGOFLEAV, HiLSRLQMRGRPNlTHAVPKAAMV- 
TALRKLHAGKVREDGRVaPHLDGHASPGADGQE- 
RVSEIISFAETDGLASSRVRLYFRSNEGNQNLFW- 
QASLWLYUOLPYVLEKGS, VRVKVYFQEQGHG- 
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DRWNMVEKRVDLXRSGWHTFPLTEAIQALFcRGE, 
LNLX)VQCDSCQaJ\\A/PVFVDPGEcSHRPFVWQ- 
ARLGDSRHRl. or naturally-occunring mammalian 
amino add sequence variants thereof; or 

c) a polypeptide free of the mature a chain 
amino add sequence comprising the inhibin a 
chain prodomain sequences KVRALFUDALGPPAV- 
TREGGDPGV. 

HALGGRTHRGSEPEEEEOVSQAILFPATDAS- 
CEDKSAARGLAQEAEEGLFRYMFRPSQHTRSRO 
VTSAQLWFHTGUDRQGTAASNSSB^LL- 
GU^SPGGPVAVPMSLGHAPPHWAVLHLATSAL- 
SLLTHPVLVUXRCPLCTCSARPE- 
ATPFLVAHTRTRPPSGGERA. or naturally-occur- 
ring mammalian amino acid sequences variants 
thereof. 

31. A celWree composition containing an in- 
hibin 0 chain prodomain sequence. 

32. To the composition of ciaim 31 wherein the 
0 chain is the ;9AChain and the composition is free 
of mature 0a chain sequence. 

33. The composition of claim 31 wherein the 0 
chain is the ^schain and the composition is free of 
mature 0b chain sequence. 

34. The composition of daim 30 wherein the 
variant is the corresponding pordne amino add 
sequence. 

36. The composition of any of daims 29 to 34 
wherein the polypeptide ts unaccompanied by na- 
tive giycosyiation, 

36. The composition of any of daims 29 to 35 
which is sterile and wherein the polypeptide further 
comprises an immunogenic polypeptide. 

37. An antibody capable of binding a predeter- 
mined polypeptide of any of daims 29 to 36. 

38. The composition of any of daims 29 to 36 
wherein the polypeptide is conjugated to a detect- 
able group. 

39. The composition of claim 38 w^ierein the 
group is an enzyme, fluorophore or radidsotope. 

40. The composition of any of claims 29 to 36, 
36 or 39 which is rnsoiubiiized by non-covatent 
absorption or covaient cross-linking to a water in- 
soluble support 

41. The composition of any of claims 29 to 36 
or 38 to 40 further comprising a physiologicaliy 
acceptable implantable matrix for confrolled release 
of the polypeptide into the tissues of an animal. 
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